Home
/
Authors
/
Irem Boybat

Author

Irem Boybat

Other affiliations: École Polytechnique Fédérale de Lausanne, University of Patras, Sabancı University

Bio: Irem Boybat is an academic researcher from IBM. The author has contributed to research in topics: Artificial neural network & Neuromorphic engineering. The author has an hindex of 16, co-authored 43 publications receiving 2721 citations. Previous affiliations of Irem Boybat include École Polytechnique Fédérale de Lausanne & University of Patras.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2013

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Neuromorphic computing using non-volatile memory

[...]

Geoffrey W. Burr¹, Robert M. Shelby¹, Abu Sebastian¹, Sangbum Kim¹, Seyoung Kim¹, Severin Sidler², Kumar Virwani¹, Masatoshi Ishii¹, Pritish Narayanan¹, Alessandro Fumarola¹, Lucas L. Sanches¹, Irem Boybat¹, Manuel Le Gallo¹, Kibong Moon³, Jiyoo Woo³, Hyunsang Hwang³, Yusuf Leblebici² - Show less +13 more•Institutions (3)

IBM¹, École Polytechnique Fédérale de Lausanne², Pohang University of Science and Technology³

02 Jan 2017

TL;DR: The relevant virtues and limitations of these devices are assessed, in terms of properties such as conductance dynamic range, (non)linearity and (a)symmetry of conductance response, retention, endurance, required switching power, and device variability.

...read moreread less

Abstract: Dense crossbar arrays of non-volatile memory (NVM) devices represent one possible path for implementing massively-parallel and highly energy-efficient neuromorphic computing systems. We first revie...

...read moreread less

800 citations

Journal Article•DOI•

Experimental Demonstration and Tolerancing of a Large-Scale Neural Network (165 000 Synapses) Using Phase-Change Memory as the Synaptic Weight Element

[...]

Geoffrey W. Burr¹, Robert M. Shelby¹, Severin Sidler¹, Carmelo di Nolfo¹, Junwoo Jang², Irem Boybat³, Rohit S. Shenoy⁴, Pritish Narayanan¹, Kumar Virwani¹, E.U. Giacometti¹, B. N. Kurdi¹, Hyunsang Hwang² - Show less +8 more•Institutions (4)

IBM¹, Pohang University of Science and Technology², École Polytechnique Fédérale de Lausanne³, Intel⁴

07 Jul 2015-IEEE Transactions on Electron Devices

TL;DR: Using 2 phase-change memory devices per synapse, a 3-layer perceptron network is trained on a subset of the MNIST database of handwritten digits using a backpropagation variant suitable for NVM+selector crossbar arrays, obtaining a training (generalization) accuracy of 82.2%.

...read moreread less

Abstract: Using two phase-change memory devices per synapse, a three-layer perceptron network with 164 885 synapses is trained on a subset (5000 examples) of the MNIST database of handwritten digits using a backpropagation variant suitable for nonvolatile memory (NVM) + selector crossbar arrays, obtaining a training (generalization) accuracy of 82.2% (82.9%). Using a neural network simulator matched to the experimental demonstrator, extensive tolerancing is performed with respect to NVM variability, yield, and the stochasticity, linearity, and asymmetry of the NVM-conductance response. We show that a bidirectional NVM with a symmetric, linear conductance response of high dynamic range is capable of delivering the same high classification accuracies on this problem as a conventional, software-based implementation of this same network.

...read moreread less

759 citations

Journal Article•DOI•

Equivalent-accuracy accelerated neural-network training using analogue memory

[...]

Stefano Ambrogio¹, Pritish Narayanan¹, Hsinyu Tsai¹, Robert M. Shelby¹, Irem Boybat¹, Irem Boybat², Carmelo di Nolfo¹, Carmelo di Nolfo², Severin Sidler², Severin Sidler¹, Massimo Giordano¹, Martina Bodini¹, Martina Bodini², Nathan C. P. Farinha¹, Benjamin Killeen¹, Christina Cheng¹, Yassine Jaoudi¹, Geoffrey W. Burr¹ - Show less +14 more•Institutions (2)

IBM¹, École Polytechnique Fédérale de Lausanne²

06 Jun 2018-Nature

TL;DR: Mixed hardware–software neural-network implementations that involve up to 204,900 synapses and that combine long-term storage in phase-change memory, near-linear updates of volatile capacitors and weight-data transfer with ‘polarity inversion’ to cancel out inherent device-to-device variations are demonstrated.

...read moreread less

Abstract: Neural-network training can be slow and energy intensive, owing to the need to transfer the weight data for the network between conventional digital memory chips and processor chips. Analogue non-volatile memory can accelerate the neural-network training algorithm known as backpropagation by performing parallelized multiply-accumulate operations in the analogue domain at the location of the weight data. However, the classification accuracies of such in situ training using non-volatile-memory hardware have generally been less than those of software-based training, owing to insufficient dynamic range and excessive weight-update asymmetry. Here we demonstrate mixed hardware-software neural-network implementations that involve up to 204,900 synapses and that combine long-term storage in phase-change memory, near-linear updates of volatile capacitors and weight-data transfer with 'polarity inversion' to cancel out inherent device-to-device variations. We achieve generalization accuracies (on previously unseen data) equivalent to those of software-based training on various commonly used machine-learning test datasets (MNIST, MNIST-backrand, CIFAR-10 and CIFAR-100). The computational energy efficiency of 28,065 billion operations per second per watt and throughput per area of 3.6 trillion operations per second per square millimetre that we calculate for our implementation exceed those of today's graphical processing units by two orders of magnitude. This work provides a path towards hardware accelerators that are both fast and energy efficient, particularly on fully connected neural-network layers.

...read moreread less

693 citations

Journal Article•DOI•

Neuromorphic computing with multi-memristive synapses

[...]

Irem Boybat¹, Irem Boybat², Manuel Le Gallo², S. R. Nandakumar², S. R. Nandakumar³, Timoleon Moraitis², Thomas Parnell², Tomas Tuma², Bipin Rajendran³, Yusuf Leblebici¹, Abu Sebastian², Evangelos Eleftheriou² - Show less +8 more•Institutions (3)

École Polytechnique Fédérale de Lausanne¹, IBM², New Jersey Institute of Technology³

28 Jun 2018-Nature Communications

TL;DR: A multi-memristive synaptic architecture with an efficient global counter-based arbitration scheme to address challenges associated with the non-ideal memristive device behavior is proposed.

...read moreread less

Abstract: Neuromorphic computing has emerged as a promising avenue towards building the next generation of intelligent computing systems. It has been proposed that memristive devices, which exhibit history-dependent conductivity modulation, could efficiently represent the synaptic weights in artificial neural networks. However, precise modulation of the device conductance over a wide dynamic range, necessary to maintain high network accuracy, is proving to be challenging. To address this, we present a multi-memristive synaptic architecture with an efficient global counter-based arbitration scheme. We focus on phase change memory devices, develop a comprehensive model and demonstrate via simulations the effectiveness of the concept for both spiking and non-spiking neural networks. Moreover, we present experimental results involving over a million phase change memory devices for unsupervised learning of temporal correlations using a spiking neural network. The work presents a significant step towards the realization of large-scale and energy-efficient neuromorphic computing systems.

...read moreread less

543 citations

Journal Article•DOI•

Accurate deep neural network inference using computational phase-change memory.

[...]

Vinay Joshi¹, Vinay Joshi², Manuel Le Gallo², Simon Haefeli², Simon Haefeli³, Irem Boybat⁴, Irem Boybat², S. R. Nandakumar², Christophe Piveteau², Christophe Piveteau³, Martino Dazzi³, Martino Dazzi², Bipin Rajendran¹, Abu Sebastian², Evangelos Eleftheriou² - Show less +11 more•Institutions (4)

King's College London¹, IBM², ETH Zurich³, École Polytechnique Fédérale de Lausanne⁴

18 May 2020-Nature Communications

TL;DR: In this article, the authors propose a methodology to train ResNet-type convolutional neural networks that results in no appreciable accuracy loss when transferring weights to phase-change memory (PCM) devices.

...read moreread less

Abstract: In-memory computing using resistive memory devices is a promising non-von Neumann approach for making energy-efficient deep learning inference hardware. However, due to device variability and noise, the network needs to be trained in a specific way so that transferring the digitally trained weights to the analog resistive memory devices will not result in significant loss of accuracy. Here, we introduce a methodology to train ResNet-type convolutional neural networks that results in no appreciable accuracy loss when transferring weights to phase-change memory (PCM) devices. We also propose a compensation technique that exploits the batch normalization parameters to improve the accuracy retention over time. We achieve a classification accuracy of 93.7% on CIFAR-10 and a top-1 accuracy of 71.6% on ImageNet benchmarks after mapping the trained weights to PCM. Our hardware results on CIFAR-10 with ResNet-32 demonstrate an accuracy above 93.5% retained over a one-day period, where each of the 361,722 synaptic weights is programmed on just two PCM devices organized in a differential configuration.

...read moreread less

206 citations

1
2
3
4
…
5
6
7
8
9
10
11

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Principles of Neural Science

[...]

Michael P. Alexander

06 Jun 1986-JAMA

TL;DR: The editors have done a masterful job of weaving together the biologic, the behavioral, and the clinical sciences into a single tapestry in which everyone from the molecular biologist to the practicing psychiatrist can find and appreciate his or her own research.

...read moreread less

Abstract: I have developed "tennis elbow" from lugging this book around the past four weeks, but it is worth the pain, the effort, and the aspirin. It is also worth the (relatively speaking) bargain price. Including appendixes, this book contains 894 pages of text. The entire panorama of the neural sciences is surveyed and examined, and it is comprehensive in its scope, from genomes to social behaviors. The editors explicitly state that the book is designed as "an introductory text for students of biology, behavior, and medicine," but it is hard to imagine any audience, interested in any fragment of neuroscience at any level of sophistication, that would not enjoy this book. The editors have done a masterful job of weaving together the biologic, the behavioral, and the clinical sciences into a single tapestry in which everyone from the molecular biologist to the practicing psychiatrist can find and appreciate his or

...read moreread less

7,563 citations

Journal Article•DOI•

ISAAC: a convolutional neural network accelerator with in-situ analog arithmetic in crossbars

[...]

Ali Shafiee¹, Anirban Nag¹, Naveen Muralimanohar², Rajeev Balasubramonian¹, John Paul Strachan², Miao Hu², R. Stanley Williams², Vivek Srikumar¹ - Show less +4 more•Institutions (2)

University of Utah¹, Hewlett-Packard²

18 Jun 2016

TL;DR: This work explores an in-situ processing approach, where memristor crossbar arrays not only store input weights, but are also used to perform dot-product operations in an analog manner.

...read moreread less

Abstract: A number of recent efforts have attempted to design accelerators for popular machine learning algorithms, such as those involving convolutional and deep neural networks (CNNs and DNNs). These algorithms typically involve a large number of multiply-accumulate (dot-product) operations. A recent project, DaDianNao, adopts a near data processing approach, where a specialized neural functional unit performs all the digital arithmetic operations and receives input weights from adjacent eDRAM banks.This work explores an in-situ processing approach, where memristor crossbar arrays not only store input weights, but are also used to perform dot-product operations in an analog manner. While the use of crossbar memory as an analog dot-product engine is well known, no prior work has designed or characterized a full-fledged accelerator based on crossbars. In particular, our work makes the following contributions: (i) We design a pipelined architecture, with some crossbars dedicated for each neural network layer, and eDRAM buffers that aggregate data between pipeline stages. (ii) We define new data encoding techniques that are amenable to analog computations and that can reduce the high overheads of analog-to-digital conversion (ADC). (iii) We define the many supporting digital components required in an analog CNN accelerator and carry out a design space exploration to identify the best balance of memristor storage/compute, ADCs, and eDRAM storage on a chip. On a suite of CNN and DNN workloads, the proposed ISAAC architecture yields improvements of 14.8×, 5.5×, and 7.5× in throughput, energy, and computational density (respectively), relative to the state-of-the-art DaDianNao architecture.

...read moreread less

1,558 citations

Journal Article•DOI•

Machine learning and the physical sciences

[...]

Giuseppe Carleo, J. Ignacio Cirac¹, Kyle Cranmer², Laurent Daudet, Maria Schuld³, Naftali Tishby⁴, Leslie Vogt-Maranto², Lenka Zdeborová⁵ - Show less +4 more•Institutions (5)

Max Planck Society¹, New York University², University of KwaZulu-Natal³, Hebrew University of Jerusalem⁴, Université Paris-Saclay⁵

06 Dec 2019-Reviews of Modern Physics

TL;DR: This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences, including conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields.

...read moreread less

Abstract: Machine learning (ML) encompasses a broad range of algorithms and modeling tools used for a vast array of data processing tasks, which has entered most scientific disciplines in recent years. This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences. This includes conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields. After giving a basic notion of machine learning methods and principles, examples are described of how statistical physics is used to understand methods in ML. This review then describes applications of ML methods in particle physics and cosmology, quantum many-body physics, quantum computing, and chemical and material physics. Research and development into novel computing architectures aimed at accelerating ML are also highlighted. Each of the sections describe recent successes as well as domain-specific methodology and challenges.

...read moreread less

1,504 citations

Journal Article•DOI•

The future of electronics based on memristive systems

[...]

Mohammed A. Zidan¹, John Paul Strachan², Wei Lu¹•Institutions (2)

University of Michigan¹, Hewlett-Packard²

01 Jan 2018

TL;DR: The state of the art in memristor-based electronics is evaluated and the future development of such devices in on-chip memory, biologically inspired computing and general-purpose in-memory computing is explored.

...read moreread less

Abstract: A memristor is a resistive device with an inherent memory. The theoretical concept of a memristor was connected to physically measured devices in 2008 and since then there has been rapid progress in the development of such devices, leading to a series of recent demonstrations of memristor-based neuromorphic hardware systems. Here, we evaluate the state of the art in memristor-based electronics and explore where the future of the field lies. We highlight three areas of potential technological impact: on-chip memory and storage, biologically inspired computing and general-purpose in-memory computing. We analyse the challenges, and possible solutions, associated with scaling the systems up for practical applications, and consider the benefits of scaling the devices down in terms of geometry and also in terms of obtaining fundamental control of the atomic-level dynamics. Finally, we discuss the ways we believe biology will continue to provide guiding principles for device innovation and system optimization in the field. This Perspective evaluates the state of the art in memristor-based electronics and explores the future development of such devices in on-chip memory, biologically inspired computing and general-purpose in-memory computing.

...read moreread less

1,231 citations

Journal Article•DOI•

PRIME: a novel processing-in-memory architecture for neural network computation in ReRAM-based main memory

[...]

Ping Chi¹, Shuangchen Li¹, Cong Xu², Tao Zhang³, Jishen Zhao¹, Yongpan Liu⁴, Yu Wang⁴, Yuan Xie¹ - Show less +4 more•Institutions (4)

University of California¹, Hewlett-Packard², Nvidia³, Tsinghua University⁴

18 Jun 2016

TL;DR: This work proposes a novel PIM architecture, called PRIME, to accelerate NN applications in ReRAM based main memory, and distinguishes itself from prior work on NN acceleration, with significant performance improvement and energy saving.

...read moreread less

Abstract: Processing-in-memory (PIM) is a promising solution to address the "memory wall" challenges for future computer systems. Prior proposed PIM architectures put additional computation logic in or near memory. The emerging metal-oxide resistive random access memory (ReRAM) has showed its potential to be used for main memory. Moreover, with its crossbar array structure, ReRAM can perform matrix-vector multiplication efficiently, and has been widely studied to accelerate neural network (NN) applications. In this work, we propose a novel PIM architecture, called PRIME, to accelerate NN applications in ReRAM based main memory. In PRIME, a portion of ReRAM crossbar arrays can be configured as accelerators for NN applications or as normal memory for a larger memory space. We provide microarchitecture and circuit designs to enable the morphable functions with an insignificant area overhead. We also design a software/hardware interface for software developers to implement various NNs on PRIME. Benefiting from both the PIM architecture and the efficiency of using ReRAM for NN computation, PRIME distinguishes itself from prior work on NN acceleration, with significant performance improvement and energy saving. Our experimental results show that, compared with a state-of-the-art neural processing unit design, PRIME improves the performance by ~2360× and the energy consumption by ~895×, across the evaluated machine learning benchmarks.

...read moreread less

1,197 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse