Home
/
Authors
/
Michael Niemier

Author

Michael Niemier

Other affiliations: Georgia Institute of Technology, University of California, Berkeley

Bio: Michael Niemier is an academic researcher from University of Notre Dame. The author has contributed to research in topics: Logic gate & Nanomagnet. The author has an hindex of 30, co-authored 194 publications receiving 3449 citations. Previous affiliations of Michael Niemier include Georgia Institute of Technology & University of California, Berkeley.

Topics: Logic gate, Nanomagnet, CMOS, Quantum dot cellular automaton, Electronic circuit ...read more

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Scaling for edge inference of deep neural networks

[...]

Xiaowei Xu¹, Yukun Ding¹, Sharon Hu¹, Michael Niemier¹, Jason Cong², Yu Hu³, Yiyu Shi¹ - Show less +3 more•Institutions (3)

University of Notre Dame¹, University of California, Los Angeles², Huazhong University of Science and Technology³

01 Apr 2018

TL;DR: There are increasing gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required.

...read moreread less

Abstract: Deep neural networks offer considerable potential across a range of applications, from advanced manufacturing to autonomous cars. A clear trend in deep neural networks is the exponential growth of network size and the associated increases in computational complexity and memory consumption. However, the performance and energy efficiency of edge inference, in which the inference (the application of a trained network to new data) is performed locally on embedded platforms that have limited area and power budget, is bounded by technology scaling. Here we analyse recent data and show that there are increasing gaps between the computational complexity and energy efficiency required by data scientists and the hardware capacity made available by hardware architects. We then discuss various architecture and algorithm innovations that could help to bridge the gaps. This Perspective highlights the existence of gaps between the computational complexity and energy efficiency required for the continued scaling of deep neural networks and the hardware capacity actually available with current CMOS technology scaling, in situations where edge inference is required; it then discusses various architecture and algorithm innovations that could help to bridge these gaps.

...read moreread less

354 citations

Journal Article•DOI•

Ferroelectric ternary content-addressable memory for one-shot learning

[...]

Kai Ni¹, Xunzhao Yin¹, Ann Franchesca Laguna¹, Siddharth Joshi¹, Stefan Dunkel², Martin Trentzsch², Johannes Müller², Sven Beyer², Michael Niemier¹, Xiaobo Sharon Hu¹, Suman Datta¹ - Show less +7 more•Institutions (2)

University of Notre Dame¹, GlobalFoundries²

01 Nov 2019

TL;DR: It is shown that ternary content-addressable memories (TCAMs) can be used as attentional memories, in which the distance between a query vector and each stored entry is computed within the memory itself, thus avoiding data transfer.

...read moreread less

Abstract: Deep neural networks are efficient at learning from large sets of labelled data, but struggle to adapt to previously unseen data. In pursuit of generalized artificial intelligence, one approach is to augment neural networks with an attentional memory so that they can draw on already learnt knowledge patterns and adapt to new but similar tasks. In current implementations of such memory augmented neural networks (MANNs), the content of a network’s memory is typically transferred from the memory to the compute unit (a central processing unit or graphics processing unit) to calculate similarity or distance norms. The processing unit hardware incurs substantial energy and latency penalties associated with transferring the data from the memory and updating the data at random memory addresses. Here, we show that ternary content-addressable memories (TCAMs) can be used as attentional memories, in which the distance between a query vector and each stored entry is computed within the memory itself, thus avoiding data transfer. Our compact and energy-efficient TCAM cell is based on two ferroelectric field-effect transistors. We evaluate the performance of our ferroelectric TCAM array prototype for one- and few-shot learning applications. When compared with a MANN where cosine distance calculations are performed on a graphics processing unit, the ferroelectric TCAM approach provides a 60-fold reduction in energy and 2,700-fold reduction in latency for a single memory search operation. A compact ternary content-addressable memory cell, which is based on two ferroelectric field-effect transistors, can provide memory augmented neural networks with improved energy and latency performance compared with traditional approaches based on graphics processing units.

...read moreread less

190 citations

Journal Article•DOI•

Nanomagnet logic: progress toward system-level integration.

[...]

Michael Niemier¹, Gary H. Bernstein¹, György Csaba¹, Aaron Dingler¹, Xiaobo Sharon Hu¹, S. Kurtz¹, Shiliang Liu¹, Joseph J. Nahas¹, Wolfgang Porod¹, M. Siddiq¹, Edit Varga¹ - Show less +7 more•Institutions (1)

University of Notre Dame¹

14 Dec 2011-Journal of Physics: Condensed Matter

TL;DR: Progress toward complete and reliable NML systems is reviewed and fundamental characteristics a device must possess if it is to be used in a digital system are reviewed.

...read moreread less

Abstract: Quoting the International Technology Roadmap for Semiconductors (ITRS) 2009 Emerging Research Devices section, 'Nanomagnetic logic (NML) has potential advantages relative to CMOS of being non-volatile, dense, low-power, and radiation-hard. Such magnetic elements are compatible with MRAM technology, which can provide input–output interfaces. Compatibility with MRAM also promises a natural integration of memory and logic. Nanomagnetic logic also appears to be scalable to the ultimate limit of using individual atomic spins.' This article reviews progress toward complete and reliable NML systems. More specifically, we (i) review experimental progress toward fundamental characteristics a device must possess if it is to be used in a digital system, (ii) consider how the NML design space may impact the system-level energy (especially when considering the clock needed to drive a computation), (iii) explain—using both the NML design space and a discussion of clocking as context—how reliable circuit operation may be achieved, (iv) highlight experimental efforts regarding CMOS friendly clock structures for NML systems, (v) explain how electrical I/O could be achieved, and (vi) conclude with a brief discussion of suitable architectures for this technology. Throughout the article, we attempt to identify important areas for future work.

...read moreread less

182 citations

Journal Article•DOI•

On-Chip Clocking for Nanomagnet Logic Devices

[...]

M.T. Alam¹, M. Siddiq¹, Gary H. Bernstein¹, Michael Niemier¹, Wolfgang Porod¹, Xiaobo Sharon Hu¹ - Show less +2 more•Institutions (1)

University of Notre Dame¹

01 May 2010-IEEE Transactions on Nanotechnology

TL;DR: In this article, the first demonstration of deterministically placed quantum-dot cellular automata (QCA) devices is presented, where devices are controlled by on-chip local fields.

...read moreread less

Abstract: We report local control of nanomagnets that can be arranged to perform computation in a cellular automata-like architecture. This letter represents the first demonstration of deterministically placed quantum-dot cellular automata (QCA) devices (of any implementation), where devices are controlled by on-chip local fields.

...read moreread less

158 citations

Journal Article•DOI•

Problems in designing with QCAs: Layout = Timing

[...]

Michael Niemier¹, Peter M. Kogge¹•Institutions (1)

University of Notre Dame¹

01 Jan 2001-International Journal of Circuit Theory and Applications

TL;DR: The design of dataow components for a simple microprocessor being designed exclusively in QCA are discussed and problems associated with initial designs and enumerated solutions to these problems are explained.

...read moreread less

Abstract: SUMMARY The quantum cellular automata (QCA) is currently being investigated as an alternative to CMOS VLSI. While some simple logical circuits and devices have been studied, little if any work has been done in considering the architecture for systems of QCA devices. This work discusses the progress of one of the rst such eorts. Namely, the design of dataow components for a simple microprocessor being designed exclusively in QCA are discussed. Problems associated with initial designs and enumerated solutions to these problems (usually stemming from oorplanning techniques) are explained. Finally, areas of future research direction for circuit design in QCA are presented. Copyright ? 2001 John Wiley & Sons, Ltd.

...read moreread less

129 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43

Collapse

Cited by

PDF

Open Access

More filters

Physical Review B

[...]

Matthew Rosseinsky

01 Jan 2011

2,117 citations

DOI•

International Technology Roadmap for Semiconductors 2003の要求清浄度について－シリコンウエハ表面と雰囲気環境に要求される清浄度, 分析方法の現状について－

[...]

飯田裕幸, 竹田菊男, 藤本武利

20 Sep 2004

1,387 citations

Journal Article•DOI•

In-memory computing with resistive switching devices

[...]

Daniele Ielmini¹, H.-S. Philip Wong²•Institutions (2)

Polytechnic University of Milan¹, Stanford University²

01 Jun 2018

TL;DR: This Review Article examines the development of in-memory computing using resistive switching devices, where the two-terminal structure of the devices, theirresistive switching properties, and direct data processing in the memory can enable area- and energy-efficient computation.

...read moreread less

Abstract: Modern computers are based on the von Neumann architecture in which computation and storage are physically separated: data are fetched from the memory unit, shuttled to the processing unit (where computation takes place) and then shuttled back to the memory unit to be stored. The rate at which data can be transferred between the processing unit and the memory unit represents a fundamental limitation of modern computers, known as the memory wall. In-memory computing is an approach that attempts to address this issue by designing systems that compute within the memory, thus eliminating the energy-intensive and time-consuming data movement that plagues current designs. Here we review the development of in-memory computing using resistive switching devices, where the two-terminal structure of the devices, their resistive switching properties, and direct data processing in the memory can enable area- and energy-efficient computation. We examine the different digital, analogue, and stochastic computing schemes that have been proposed, and explore the microscopic physical mechanisms involved. Finally, we discuss the challenges in-memory computing faces, including the required scaling characteristics, in delivering next-generation computing. This Review Article examines the development of in-memory computing using resistive switching devices.

...read moreread less

1,193 citations

Design Of Analog Cmos Integrated Circuits

[...]

Franziska Hoffmann

01 Jan 2016

TL;DR: The design of analog cmos integrated circuits is universally compatible with any devices to read and is available in the book collection an online access to it is set as public so you can download it instantly.

...read moreread less

Abstract: Thank you for downloading design of analog cmos integrated circuits. Maybe you have knowledge that, people have look hundreds times for their chosen books like this design of analog cmos integrated circuits, but end up in malicious downloads. Rather than enjoying a good book with a cup of coffee in the afternoon, instead they juggled with some harmful virus inside their computer. design of analog cmos integrated circuits is available in our book collection an online access to it is set as public so you can download it instantly. Our digital library spans in multiple countries, allowing you to get the most less latency time to download any of our books like this one. Kindly say, the design of analog cmos integrated circuits is universally compatible with any devices to read.

...read moreread less

1,038 citations

Journal Article•DOI•

Memory devices and applications for in-memory computing

[...]

Abu Sebastian¹, Manuel Le Gallo¹, Riduan Khaddam-Aljameh¹, Evangelos Eleftheriou¹•Institutions (1)

IBM¹

30 Mar 2020-Nature Nanotechnology

TL;DR: This Review provides an overview of memory devices and the key computational primitives enabled by these memory devices as well as their applications spanning scientific computing, signal processing, optimization, machine learning, deep learning and stochastic computing.

...read moreread less

Abstract: Traditional von Neumann computing systems involve separate processing and memory units. However, data movement is costly in terms of time and energy and this problem is aggravated by the recent explosive growth in highly data-centric applications related to artificial intelligence. This calls for a radical departure from the traditional systems and one such non-von Neumann computational approach is in-memory computing. Hereby certain computational tasks are performed in place in the memory itself by exploiting the physical attributes of the memory devices. Both charge-based and resistance-based memory devices are being explored for in-memory computing. In this Review, we provide a broad overview of the key computational primitives enabled by these memory devices as well as their applications spanning scientific computing, signal processing, optimization, machine learning, deep learning and stochastic computing. This Review provides an overview of memory devices and the key computational primitives for in-memory computing, and examines the possibilities of applying this computing approach to a wide range of applications.

...read moreread less

841 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse