Home
/
Authors
/
Kai Ni

Author

Kai Ni

Other affiliations: Katholieke Universiteit Leuven, University of Notre Dame, Vanderbilt University

Bio: Kai Ni is an academic researcher from Rochester Institute of Technology. The author has contributed to research in topics: Computer science & Non-volatile memory. The author has an hindex of 19, co-authored 99 publications receiving 1571 citations. Previous affiliations of Kai Ni include Katholieke Universiteit Leuven & University of Notre Dame.

Topics: Computer science, Non-volatile memory, Ferroelectricity, CMOS, Logic gate ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Ferroelectric FET analog synapse for acceleration of deep neural network training

[...]

Matthew Jerry¹, Pai-Yu Chen², Jianchi Zhang¹, Pankaj Sharma¹, Kai Ni¹, Shimeng Yu², Suman Datta¹ - Show less +3 more•Institutions (2)

University of Notre Dame¹, Arizona State University²

01 Dec 2017

TL;DR: A transient Presiach model is developed that accurately predicts minor loop trajectories and remnant polarization charge for arbitrary pulse width, voltage, and history of FeFET synapses and reveals a 103 to 106 acceleration in online learning latency over multi-state RRAM based analog synapses.

...read moreread less

Abstract: The memory requirement of at-scale deep neural networks (DNN) dictate that synaptic weight values be stored and updated in off-chip memory such as DRAM, limiting the energy efficiency and training time. Monolithic cross-bar / pseudo cross-bar arrays with analog non-volatile memories capable of storing and updating weights on-chip offer the possibility of accelerating DNN training. Here, we harness the dynamics of voltage controlled partial polarization switching in ferroelectric-FETs (FeFET) to demonstrate such an analog synapse. We develop a transient Presiach model that accurately predicts minor loop trajectories and remnant polarization charge (P r ) for arbitrary pulse width, voltage, and history. We experimentally demonstrate a 5-bit FeFET synapse with symmetric potentiation and depression characteristics, and a 45x tunable range in conductance with 75ns update pulse. A circuit macro-model is used to evaluate and benchmark on-chip learning performance (area, latency, energy, accuracy) of FeFET synaptic core revealing a 103 to 106 acceleration in online learning latency over multi-state RRAM based analog synapses.

...read moreread less

367 citations

Journal Article•DOI•

The era of hyper-scaling in electronics

[...]

Sayeef Salahuddin¹, Kai Ni², Suman Datta²•Institutions (2)

University of California, Berkeley¹, University of Notre Dame²

01 Aug 2018

TL;DR: This Perspective argues that electronics is poised to enter a new era of scaling – hyper-scaling – driven by advances in beyond-Boltzmann transistors, embedded non-volatile memories, monolithic three-dimensional integration and heterogeneous integration techniques.

...read moreread less

Abstract: In the past five decades, the semiconductor industry has gone through two distinct eras of scaling: the geometric (or classical) scaling era and the equivalent (or effective) scaling era. As transistor and memory features approach 10 nanometres, it is apparent that room for further scaling in the horizontal direction is running out. In addition, the rise of data abundant computing is exacerbating the interconnect bottleneck that exists in conventional computing architecture between the compute cores and the memory blocks. Here we argue that electronics is poised to enter a new, third era of scaling — hyper-scaling — in which resources are added when needed to meet the demands of data abundant workloads. This era will be driven by advances in beyond-Boltzmann transistors, embedded non-volatile memories, monolithic three-dimensional integration and heterogeneous integration techniques. This Perspective argues that electronics is poised to enter a new era of scaling – hyper-scaling – driven by advances in beyond-Boltzmann transistors, embedded non-volatile memories, monolithic three-dimensional integration, and heterogeneous integration techniques.

...read moreread less

343 citations

Journal Article•DOI•

Critical Role of Interlayer in Hf 0.5 Zr 0.5 O 2 Ferroelectric FET Nonvolatile Memory Performance

[...]

Kai Ni¹, Pankaj Sharma¹, Jianchi Zhang¹, Matthew Jerry¹, Jeffery A. Smith¹, Kandabara Tapily, Robert D. Clark, Souvik Mahapatra², Suman Datta¹ - Show less +5 more•Institutions (2)

University of Notre Dame¹, Indian Institute of Technology Bombay²

27 Apr 2018-IEEE Transactions on Electron Devices

TL;DR: In this paper, the critical design criteria of Hf0.5Zr 0.5O2 (HZO)-based ferroelectric field effect transistor (FeFET) for nonvolatile memory application were established.

...read moreread less

Abstract: We fabricate, characterize, and establish the critical design criteria of Hf0.5Zr0.5O2 (HZO)-based ferroelectric field effect transistor (FeFET) for nonvolatile memory application. We quantify ${V}_{\textsf {TH}}$ shift from electron (hole) trapping in the vicinity of ferroelectric (FE)/interlayer (IL) interface, induced by erase (program) pulse, and ${V}_{\textsf {TH}}$ shift from polarization switching to determine true memory window (MW). The devices exhibit extrapolated retention up to 10 years at 85 °C and endurance up to $5\times 10^{6}$ cycles initiated by the IL breakdown. Endurance up to 1012 cycles of partial polarization switching is shown in metal–FE–metal capacitor, in the absence of IL. A comprehensive metal–FE–insulator–semiconductor FeFET model is developed to quantify the electric field distribution in the gate-stack, and an IL design guideline is established to markedly enhance MW, retention characteristics, and cycling endurance.

...read moreread less

247 citations

Journal Article•DOI•

Ferroelectric ternary content-addressable memory for one-shot learning

[...]

Kai Ni¹, Xunzhao Yin¹, Ann Franchesca Laguna¹, Siddharth Joshi¹, Stefan Dunkel², Martin Trentzsch², Johannes Müller², Sven Beyer², Michael Niemier¹, Xiaobo Sharon Hu¹, Suman Datta¹ - Show less +7 more•Institutions (2)

University of Notre Dame¹, GlobalFoundries²

01 Nov 2019

TL;DR: It is shown that ternary content-addressable memories (TCAMs) can be used as attentional memories, in which the distance between a query vector and each stored entry is computed within the memory itself, thus avoiding data transfer.

...read moreread less

Abstract: Deep neural networks are efficient at learning from large sets of labelled data, but struggle to adapt to previously unseen data. In pursuit of generalized artificial intelligence, one approach is to augment neural networks with an attentional memory so that they can draw on already learnt knowledge patterns and adapt to new but similar tasks. In current implementations of such memory augmented neural networks (MANNs), the content of a network’s memory is typically transferred from the memory to the compute unit (a central processing unit or graphics processing unit) to calculate similarity or distance norms. The processing unit hardware incurs substantial energy and latency penalties associated with transferring the data from the memory and updating the data at random memory addresses. Here, we show that ternary content-addressable memories (TCAMs) can be used as attentional memories, in which the distance between a query vector and each stored entry is computed within the memory itself, thus avoiding data transfer. Our compact and energy-efficient TCAM cell is based on two ferroelectric field-effect transistors. We evaluate the performance of our ferroelectric TCAM array prototype for one- and few-shot learning applications. When compared with a MANN where cosine distance calculations are performed on a graphics processing unit, the ferroelectric TCAM approach provides a 60-fold reduction in energy and 2,700-fold reduction in latency for a single memory search operation. A compact ternary content-addressable memory cell, which is based on two ferroelectric field-effect transistors, can provide memory augmented neural networks with improved energy and latency performance compared with traditional approaches based on graphics processing units.

...read moreread less

190 citations

Proceedings Article•DOI•

A Circuit Compatible Accurate Compact Model for Ferroelectric-FETs

[...]

Kai Ni¹, Matthew Jerry¹, Jeffrey Smith¹, Suman Datta¹•Institutions (1)

University of Notre Dame¹

18 Jun 2018

TL;DR: In this paper, the authors developed a compact model of ferroelectric field effect transistors (FeFET) for memory applications, enabling their exploration at the circuit and architecture level.

...read moreread less

Abstract: In this work we develop a compact model of ferroelectric field-effect-transistors (FeFET) for memory applications, enabling their exploration at the circuit and architecture level. In contrast to Landau-Khalatnikov (L-K) based approaches, the presented model is founded on the combination of a nucleation dominated multi-domain Presiach theory of ferroelectric switching with a conventional transistor model. The model successfully reproduces the evolution of the FeFET memory window as a function of the program and erase conditions (amplitude, pulse width, and history). To calibrate the model, we fabricated 10nm thick Hf 0.4 Zr 0.6 O 2 (HZO) MFM capacitors and FeFETs and characterized the polarization switching dynamics. Our results highlight the importance of accounting for the switching history, minor loop trajectory, and coupled time-voltage response of the ferroelectric to quantitatively reproduce the measured FeFET characteristics.

...read moreread less

120 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Memory devices and applications for in-memory computing

[...]

Abu Sebastian¹, Manuel Le Gallo¹, Riduan Khaddam-Aljameh¹, Evangelos Eleftheriou¹•Institutions (1)

IBM¹

30 Mar 2020-Nature Nanotechnology

TL;DR: This Review provides an overview of memory devices and the key computational primitives enabled by these memory devices as well as their applications spanning scientific computing, signal processing, optimization, machine learning, deep learning and stochastic computing.

...read moreread less

Abstract: Traditional von Neumann computing systems involve separate processing and memory units. However, data movement is costly in terms of time and energy and this problem is aggravated by the recent explosive growth in highly data-centric applications related to artificial intelligence. This calls for a radical departure from the traditional systems and one such non-von Neumann computational approach is in-memory computing. Hereby certain computational tasks are performed in place in the memory itself by exploiting the physical attributes of the memory devices. Both charge-based and resistance-based memory devices are being explored for in-memory computing. In this Review, we provide a broad overview of the key computational primitives enabled by these memory devices as well as their applications spanning scientific computing, signal processing, optimization, machine learning, deep learning and stochastic computing. This Review provides an overview of memory devices and the key computational primitives for in-memory computing, and examines the possibilities of applying this computing approach to a wide range of applications.

...read moreread less

841 citations

Journal Article•DOI•

Neuro-Inspired Computing With Emerging Nonvolatile Memorys

[...]

Shimeng Yu¹•Institutions (1)

Arizona State University¹

23 Jan 2018

TL;DR: This comprehensive review summarizes state of the art, challenges, and prospects of the neuro-inspired computing with emerging nonvolatile memory devices and presents a device-circuit-algorithm codesign methodology to evaluate the impact of nonideal device effects on the system-level performance.

...read moreread less

Abstract: This comprehensive review summarizes state of the art, challenges, and prospects of the neuro-inspired computing with emerging nonvolatile memory devices. First, we discuss the demand for developing neuro-inspired architecture beyond today’s von-Neumann architecture. Second, we summarize the various approaches to designing the neuromorphic hardware (digital versus analog, spiking versus nonspiking, online training versus offline training) and discuss why emerging nonvolatile memory is attractive for implementing the synapses in the neural network. Then, we discuss the desired device characteristics of the synaptic devices (e.g., multilevel states, weight update nonlinearity/asymmetry, variation/noise), and survey a few representative material systems and device prototypes reported in the literature that show the analog conductance tuning. These candidates include phase change memory, resistive memory, ferroelectric memory, floating-gate transistors, etc. Next, we introduce the crossbar array architecture to accelerate the weighted sum and weight update operations that are commonly used in the neuro-inspired machine learning algorithms, and review the recent progresses of array-level experimental demonstrations for pattern recognition tasks. In addition, we discuss the peripheral neuron circuit design issues and present a device-circuit-algorithm codesign methodology to evaluate the impact of nonideal device effects on the system-level performance (e.g., learning accuracy). Finally, we give an outlook on the customization of the learning algorithms for efficient hardware implementation.

...read moreread less

730 citations

Journal Article•DOI•

Resistive switching materials for information processing

[...]

Zhongrui Wang¹, Huaqiang Wu², Geoffrey W. Burr³, Cheol Seong Hwang⁴, Kang L. Wang⁵, Qiangfei Xia¹, Jianhua Yang¹ - Show less +3 more•Institutions (5)

University of Massachusetts Amherst¹, Tsinghua University², IBM³, Seoul National University⁴, University of California, Los Angeles⁵

13 Jan 2020-Nature Reviews Materials

TL;DR: This Review surveys the four physical mechanisms that lead to resistive switching materials enable novel, in-memory information processing, which may resolve the von Neumann bottleneck and examines the device requirements for systems based on RSMs.

...read moreread less

Abstract: The rapid increase in information in the big-data era calls for changes to information-processing paradigms, which, in turn, demand new circuit-building blocks to overcome the decreasing cost-effectiveness of transistor scaling and the intrinsic inefficiency of using transistors in non-von Neumann computing architectures. Accordingly, resistive switching materials (RSMs) based on different physical principles have emerged for memories that could enable energy-efficient and area-efficient in-memory computing. In this Review, we survey the four physical mechanisms that lead to such resistive switching: redox reactions, phase transitions, spin-polarized tunnelling and ferroelectric polarization. We discuss how these mechanisms equip RSMs with desirable properties for representation capability, switching speed and energy, reliability and device density. These properties are the key enablers of processing-in-memory platforms, with applications ranging from neuromorphic computing and general-purpose memcomputing to cybersecurity. Finally, we examine the device requirements for such systems based on RSMs and provide suggestions to address challenges in materials engineering, device optimization, system integration and algorithm design. Resistive switching materials enable novel, in-memory information processing, which may resolve the von Neumann bottleneck. This Review focuses on how the switching mechanisms and the resultant electrical properties lead to various computing applications.

...read moreread less

564 citations

Journal Article•DOI•

Enhanced ferroelectricity in ultrathin films grown directly on silicon.

[...]

Suraj Cheema¹, Daewoong Kwon¹, Daewoong Kwon², Nirmaan Shanker¹, Roberto dos Reis³, Shang-Lin Hsu³, Shang-Lin Hsu⁴, Jun Xiao¹, Haigang Zhang⁵, Ryan Wagner⁵, Adhiraj Datar¹, Margaret McCarter¹, Claudy Serrao¹, Ajay K. Yadav¹, Golnaz Karbasian¹, Cheng-Hsiang Hsu¹, Ava J. Tan¹, Li Chen Wang¹, Vishal Thakare¹, Xiang Zhang¹, Apurva Mehta⁶, Evguenia Karapetrova⁷, Rajesh V. Chopdekar⁴, Padraic Shafer⁴, Elke Arenholz⁴, Elke Arenholz⁸, Chenming Hu¹, Roger Proksch⁵, Ramamoorthy Ramesh¹, Jim Ciston³, Sayeef Salahuddin¹, Sayeef Salahuddin⁴ - Show less +28 more•Institutions (8)

University of California, Berkeley¹, Inha University², National Center for Electron Microscopy³, Lawrence Berkeley National Laboratory⁴, Oxford Instruments⁵, SLAC National Accelerator Laboratory⁶, Argonne National Laboratory⁷, Cornell University⁸

23 Apr 2020-Nature

TL;DR: This work shifts the search for the fundamental limits of ferroelectricity to simpler transition-metal oxide systems—that is, from perovskite-derived complex oxides to fluorite-structure binary oxides—in which ‘reverse’ size effects counterintuitively stabilize polar symmetry in the ultrathin regime.

...read moreread less

Abstract: Ultrathin ferroelectric materials could potentially enable low-power logic and nonvolatile memories1,2. As ferroelectric materials are made thinner, however, the ferroelectricity is usually suppressed. Size effects in ferroelectrics have been thoroughly investigated in perovskite oxides—the archetypal ferroelectric system3. Perovskites, however, have so far proved unsuitable for thickness scaling and integration with modern semiconductor processes4. Here we report ferroelectricity in ultrathin doped hafnium oxide (HfO2), a fluorite-structure oxide grown by atomic layer deposition on silicon. We demonstrate the persistence of inversion symmetry breaking and spontaneous, switchable polarization down to a thickness of one nanometre. Our results indicate not only the absence of a ferroelectric critical thickness but also enhanced polar distortions as film thickness is reduced, unlike in perovskite ferroelectrics. This approach to enhancing ferroelectricity in ultrathin layers could provide a route towards polarization-driven memories and ferroelectric-based advanced transistors. This work shifts the search for the fundamental limits of ferroelectricity to simpler transition-metal oxide systems—that is, from perovskite-derived complex oxides to fluorite-structure binary oxides—in which ‘reverse’ size effects counterintuitively stabilize polar symmetry in the ultrathin regime. Enhanced switchable ferroelectric polarization is achieved in doped hafnium oxide films grown directly onto silicon using low-temperature atomic layer deposition, even at thicknesses of just one nanometre.

...read moreread less

431 citations

Journal Article•DOI•

A comprehensive review on emerging artificial neuromorphic devices

[...]

Jiadi Zhu¹, Teng Zhang¹, Yuchao Yang¹, Ru Huang¹•Institutions (1)

Peking University¹

24 Feb 2020-Applied physics reviews

TL;DR: A comprehensive review on emerging artificial neuromorphic devices and their applications is offered, showing that anion/cation migration-based memristive devices, phase change, and spintronic synapses have been quite mature and possess excellent stability as a memory device, yet they still suffer from challenges in weight updating linearity and symmetry.

...read moreread less

Abstract: The rapid development of information technology has led to urgent requirements for high efficiency and ultralow power consumption. In the past few decades, neuromorphic computing has drawn extensive attention due to its promising capability in processing massive data with extremely low power consumption. Here, we offer a comprehensive review on emerging artificial neuromorphic devices and their applications. In light of the inner physical processes, we classify the devices into nine major categories and discuss their respective strengths and weaknesses. We will show that anion/cation migration-based memristive devices, phase change, and spintronic synapses have been quite mature and possess excellent stability as a memory device, yet they still suffer from challenges in weight updating linearity and symmetry. Meanwhile, the recently developed electrolyte-gated synaptic transistors have demonstrated outstanding energy efficiency, linearity, and symmetry, but their stability and scalability still need to be optimized. Other emerging synaptic structures, such as ferroelectric, metal–insulator transition based, photonic, and purely electronic devices also have limitations in some aspects, therefore leading to the need for further developing high-performance synaptic devices. Additional efforts are also demanded to enhance the functionality of artificial neurons while maintaining a relatively low cost in area and power, and it will be of significance to explore the intrinsic neuronal stochasticity in computing and optimize their driving capability, etc. Finally, by looking into the correlations between the operation mechanisms, material systems, device structures, and performance, we provide clues to future material selections, device designs, and integrations for artificial synapses and neurons.

...read moreread less

373 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse