Home
/
Authors
/
Ivan Vo

Author

Ivan Vo

Bio: Ivan Vo is an academic researcher from IBM. The author has contributed to research in topics: Voltage regulator & CMOS. The author has an hindex of 11, co-authored 22 publications receiving 2976 citations.

Topics: Voltage regulator, CMOS, Microarchitecture, Logic gate, Jitter ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A million spiking-neuron integrated circuit with a scalable communication network and interface

[...]

Paul A. Merolla¹, John V. Arthur¹, Rodrigo Alvarez-Icaza¹, Andrew S. Cassidy¹, Jun Sawada¹, Filipp Akopyan¹, Bryan L. Jackson¹, Nabil Imam², Chen Guo¹, Yutaka Nakamura¹, Bernard Brezzo¹, Ivan Vo¹, Steven K. Esser¹, Rathinakumar Appuswamy¹, Brian Taba¹, Arnon Amir¹, Myron D. Flickner¹, William P. Risk¹, Rajit Manohar², Dharmendra S. Modha¹ - Show less +16 more•Institutions (2)

IBM¹, Cornell University²

08 Aug 2014-Science

TL;DR: Inspired by the brain’s structure, an efficient, scalable, and flexible non–von Neumann architecture is developed that leverages contemporary silicon technology and is well suited to many applications that use complex neural networks in real time, for example, multiobject detection and classification.

...read moreread less

Abstract: Inspired by the brain’s structure, we have developed an efficient, scalable, and flexible non–von Neumann architecture that leverages contemporary silicon technology. To demonstrate, we built a 5.4-billion-transistor chip with 4096 neurosynaptic cores interconnected via an intrachip network that integrates 1 million programmable spiking neurons and 256 million configurable synapses. Chips can be tiled in two dimensions via an interchip communication interface, seamlessly scaling the architecture to a cortexlike sheet of arbitrary size. The architecture is well suited to many applications that use complex neural networks in real time, for example, multiobject detection and classification. With 400-pixel-by-240-pixel video input at 30 frames per second, the chip consumes 63 milliwatts.

...read moreread less

3,253 citations

Patent•

High Performance eDRAM Sense Amplifier

[...]

Fadi H. Gebara¹, Jente B. Kuang¹, Jayakumaran Sivagnaname¹, Ivan Vo¹•Institutions (1)

IBM¹

01 Feb 2010

TL;DR: In this paper, a bit line connected to each of a first plurality of eDRAM cells is controlled by cell control lines tied to each cell, and during a READ operation, the EDRAM cell releases its charge indicating its digital state.

...read moreread less

Abstract: Embedded dynamic random access memory (eDRAM) sense amplifier circuitry in which a bit line connected to each of a first plurality of eDRAM cells is controlled by cell control lines tied to each of the cells. During a READ operation the eDRAM cell releases its charge indicating its digital state. The digital charge propagates through the eDRAM sense amplifier circuitry to a mid-rail amplifier inverter circuit which amplifies the charge and provides it to a latch circuit. The latch circuit, in turn, inverts the charge to correctly represent at its output the logical value stored in the eDRAM cell being read, and returns the charge through the eDRAM sense amplifier circuitry to replenish the eDRAM cell.

...read moreread less

124 citations

Proceedings Article•DOI•

Real-time scalable cortical computing at 46 giga-synaptic OPS/watt with ~100× speedup in time-to-solution and ~100,000× reduction in energy-to-solution

[...]

Andrew S. Cassidy¹, Rodrigo Alvarez-Icaza¹, Filipp Akopyan¹, Jun Sawada¹, John V. Arthur¹, Paul A. Merolla¹, Pallab Datta¹, Marc Gonzalez Tallada¹, Brian Taba¹, Alexander Andreopoulos¹, Arnon Amir¹, Steven K. Esser¹, Jeff Kusnitz¹, Rathinakumar Appuswamy¹, C. Haymes¹, Bernard Brezzo¹, Roger Moussalli¹, Ralph Bellofatto¹, Christian W. Baks¹, Michael Mastro¹, Kai Schleupen¹, Charles Edwin Cox¹, Ken Inoue¹, Steve Millman¹, Nabil Imam², Emmett McQuinn¹, Yutaka Nakamura¹, Ivan Vo¹, Chen Guok¹, Don Nguyen, Scott Lekuch¹, Sameh W. Asaad¹, Daniel Friedman¹, Bryan L. Jackson¹, Myron D. Flickner¹, William P. Risk¹, Rajit Manohar², Dharmendra S. Modha¹ - Show less +34 more•Institutions (2)

IBM¹, Cornell University²

16 Nov 2014

TL;DR: True North is a 4,096 core, 1 million neuron, and 256 million synapse brain-inspired neurosynaptic processor, that consumes 65mW of power running at real-time and delivers performance of 46 Giga-Synaptic OPS/Watt.

...read moreread less

Abstract: Drawing on neuroscience, we have developed a parallel, event-driven kernel for neurosynaptic computation, that is efficient with respect to computation, memory, and communication. Building on the previously demonstrated highly optimized software expression of the kernel, here, we demonstrate True North, a co-designed silicon expression of the kernel. True North achieves five orders of magnitude reduction in energy to-solution and two orders of magnitude speedup in time-to solution, when running computer vision applications and complex recurrent neural network simulations. Breaking path with the von Neumann architecture, True North is a 4,096 core, 1 million neuron, and 256 million synapse brain-inspired neurosynaptic processor, that consumes 65mW of power running at real-time and delivers performance of 46 Giga-Synaptic OPS/Watt. We demonstrate seamless tiling of True North chips into arrays, forming a foundation for cortex-like scalability. True North's unprecedented time-to-solution, energy-to-solution, size, scalability, and performance combined with the underlying flexibility of the kernel enable a broad range of cognitive applications.

...read moreread less

82 citations

Journal Article•DOI•

A 1.0-GHz single-issue 64-bit powerPC integer processor

[...]

Joel Abraham Silberman¹, Naoaki Aoki, David W. Boerstler, Jeffrey L. Burns, Sang Hoo Dhong, A. Essbaum, Uttam Shyamalindu Ghoshal, David F. Heidel, Peter Hofstee, Kyung Tek Lee, David Meltzer, Hung Ngo, Kevin J. Nowka, S. Posluszny, O. Takahashi, Ivan Vo, B. Zoric - Show less +13 more•Institutions (1)

IBM¹

05 Feb 1998

TL;DR: This 64 b single-issue integer processor, comprised of about one million transistors, is fabricated in a 0.15 /spl mu/m effective channel length, six-metal-layer CMOS technology and intended as a vehicle to explore circuit, clocking, microarchitecture, and methodology options for high-frequency processors.

...read moreread less

Abstract: This 64 b single-issue integer processor, comprised of about one million transistors, is fabricated in a 0.15 /spl mu/m effective channel length, six-metal-layer CMOS technology. Intended as a vehicle to explore circuit, clocking, microarchitecture, and methodology options for high-frequency processors, the processor prototype implements 60 fixed-point compare, logical, arithmetic, and rotate-merge-mask instructions of the PowerPC instruction-set architecture with single-cycle latency. The processor executes programs written in this instruction subset from cache with a 1 ns cycle. In addition, the prototype implements 36 PowerPC load/store instructions that execute as single-cycle operations (zero wait cycles) with 1.15 ns latency. Full data forwarding and full at speed scan testing are supported.

...read moreread less

75 citations

Proceedings Article•DOI•

Design methodology for a 1.0 GHz microprocessor

[...]

S. Posluszny¹, Naoaki Aoki¹, David William Boerstler¹, Jeffrey L. Burns¹, Sang Hoo Dhong¹, Uttam Shyamalindu Ghoshal¹, P. Hofstee¹, D. LaPotin¹, Kyung Tek Lee¹, David Meltzer¹, H.C. Ngo¹, Kevin J. Nowka¹, Joel Abraham Silberman¹, Osamu Takahashi¹, Ivan Vo¹ - Show less +11 more•Institutions (1)

IBM¹

05 Oct 1998

TL;DR: The design methodology used to build an experimental 1.0 GigaHertz PowerPC integer microprocessor at IBM's Austin Research Laboratory will cover design and verification tools as well as circuit constraints and microarchitecture philosophy.

...read moreread less

Abstract: This paper describes the design methodology used to build an experimental 1.0 GigaHertz PowerPC integer microprocessor at IBM's Austin Research Laboratory. The high frequency requirements dictated the chip composition to be almost entirely custom macros using dynamic circuit techniques. The methodology presented will cover design and verification tools as well as circuit constraints and microarchitecture philosophy. The microarchitecture, circuits and tools were defined by the high frequency requirements of the processor as well as the aggressive design schedule and size of the design team.

...read moreread less

49 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Deep learning in neural networks

[...]

Jürgen Schmidhuber¹•Institutions (1)

University of Lugano¹

01 Jan 2015-Neural Networks

TL;DR: This historical survey compactly summarizes relevant work, much of it from the previous millennium, review deep supervised learning, unsupervised learning, reinforcement learning & evolutionary computation, and indirect search for short programs encoding deep and large networks.

...read moreread less

14,635 citations

Journal Article•DOI•

Efficient Processing of Deep Neural Networks: A Tutorial and Survey

[...]

Vivienne Sze¹, Yu-Hsin Chen¹, Tien-Ju Yang¹, Joel Emer¹•Institutions (1)

Massachusetts Institute of Technology¹

20 Nov 2017

TL;DR: In this paper, the authors provide a comprehensive tutorial and survey about the recent advances toward the goal of enabling efficient processing of DNNs, and discuss various hardware platforms and architectures that support DNN, and highlight key trends in reducing the computation cost of deep neural networks either solely via hardware design changes or via joint hardware and DNN algorithm changes.

...read moreread less

Abstract: Deep neural networks (DNNs) are currently widely used for many artificial intelligence (AI) applications including computer vision, speech recognition, and robotics. While DNNs deliver state-of-the-art accuracy on many AI tasks, it comes at the cost of high computational complexity. Accordingly, techniques that enable efficient processing of DNNs to improve energy efficiency and throughput without sacrificing application accuracy or increasing hardware cost are critical to the wide deployment of DNNs in AI systems. This article aims to provide a comprehensive tutorial and survey about the recent advances toward the goal of enabling efficient processing of DNNs. Specifically, it will provide an overview of DNNs, discuss various hardware platforms and architectures that support DNNs, and highlight key trends in reducing the computation cost of DNNs either solely via hardware design changes or via joint hardware design and DNN algorithm changes. It will also summarize various development resources that enable researchers and practitioners to quickly get started in this field, and highlight important benchmarking metrics and design considerations that should be used for evaluating the rapidly growing number of DNN hardware designs, optionally including algorithmic codesigns, being proposed in academia and industry. The reader will take away the following concepts from this article: understand the key design considerations for DNNs; be able to evaluate different DNN hardware implementations with benchmarks and comparison metrics; understand the tradeoffs between various hardware architectures and platforms; be able to evaluate the utility of various DNN design techniques for efficient processing; and understand recent implementation trends and opportunities.

...read moreread less

2,391 citations

Journal Article•DOI•

Loihi: A Neuromorphic Manycore Processor with On-Chip Learning

[...]

Michael Davies¹, Narayan Srinivasa, Tsung-Han Lin¹, Gautham N. Chinya¹, Cao Yongqiang¹, Sri Harsha Choday¹, Georgios D. Dimou, Prasad Joshi¹, Nabil Imam¹, Shweta Jain¹, Yuyun Liao¹, Chit-Kwan Lin¹, Andrew Lines¹, Ruokun Liu¹, Deepak A. Mathaikutty¹, Steven McCoy¹, Arnab Paul¹, Jonathan Tse¹, Guruguhanathan Venkataramanan¹, Yi-Hsin Weng¹, Andreas Wild¹, Yoon Seok Yang¹, Hong Wang¹ - Show less +19 more•Institutions (1)

Intel¹

16 Jan 2018-IEEE Micro

TL;DR: Loihi is a 60-mm2 chip fabricated in Intels 14-nm process that advances the state-of-the-art modeling of spiking neural networks in silicon, and can solve LASSO optimization problems with over three orders of magnitude superior energy-delay-product compared to conventional solvers running on a CPU iso-process/voltage/area.

...read moreread less

Abstract: Loihi is a 60-mm2 chip fabricated in Intels 14-nm process that advances the state-of-the-art modeling of spiking neural networks in silicon. It integrates a wide range of novel features for the field, such as hierarchical connectivity, dendritic compartments, synaptic delays, and, most importantly, programmable synaptic learning rules. Running a spiking convolutional form of the Locally Competitive Algorithm, Loihi can solve LASSO optimization problems with over three orders of magnitude superior energy-delay-product compared to conventional solvers running on a CPU iso-process/voltage/area. This provides an unambiguous example of spike-based computation, outperforming all known conventional solutions.

...read moreread less

2,331 citations

Journal Article•DOI•

Training and operation of an integrated neuromorphic network based on metal-oxide memristors

[...]

Mirko Prezioso¹, Farnood Merrikh-Bayat¹, Brian D. Hoskins¹, Gina C. Adam¹, Konstantin K. Likharev², Dmitri B. Strukov¹ - Show less +2 more•Institutions (2)

University of California, Santa Barbara¹, Stony Brook University²

07 May 2015-Nature

TL;DR: The experimental implementation of transistor-free metal-oxide memristor crossbars, with device variability sufficiently low to allow operation of integrated neural networks, in a simple network: a single-layer perceptron (an algorithm for linear classification).

...read moreread less

Abstract: Despite much progress in semiconductor integrated circuit technology, the extreme complexity of the human cerebral cortex, with its approximately 10(14) synapses, makes the hardware implementation of neuromorphic networks with a comparable number of devices exceptionally challenging. To provide comparable complexity while operating much faster and with manageable power dissipation, networks based on circuits combining complementary metal-oxide-semiconductors (CMOSs) and adjustable two-terminal resistive devices (memristors) have been developed. In such circuits, the usual CMOS stack is augmented with one or several crossbar layers, with memristors at each crosspoint. There have recently been notable improvements in the fabrication of such memristive crossbars and their integration with CMOS circuits, including first demonstrations of their vertical integration. Separately, discrete memristors have been used as artificial synapses in neuromorphic networks. Very recently, such experiments have been extended to crossbar arrays of phase-change memristive devices. The adjustment of such devices, however, requires an additional transistor at each crosspoint, and hence these devices are much harder to scale than metal-oxide memristors, whose nonlinear current-voltage curves enable transistor-free operation. Here we report the experimental implementation of transistor-free metal-oxide memristor crossbars, with device variability sufficiently low to allow operation of integrated neural networks, in a simple network: a single-layer perceptron (an algorithm for linear classification). The network can be taught in situ using a coarse-grain variety of the delta rule algorithm to perform the perfect classification of 3 × 3-pixel black/white images into three classes (representing letters). This demonstration is an important step towards much larger and more complex memristive neuromorphic networks.

...read moreread less

2,222 citations

Journal Article•DOI•

The computational brain: Patricia S. Churchland and Terrence J. Sejnowski (MIT Press, Cambridge, MA, 1992); xi, 544 pages, $39.95

[...]

George N. Reeke

01 Apr 1996-Artificial Intelligence

TL;DR: The Computational Brain this paper provides a broad overview of neuroscience and computational theory, followed by a study of some of the most recent and sophisticated modeling work in the context of relevant neurobiological research.

...read moreread less

1,472 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse