Deep learning with coherent nanophotonic circuits

doi:10.1038/NPHOTON.2017.93

Home
/
Papers
/
Deep learning with coherent nanophotonic circuits

Journal Article•DOI•

Deep learning with coherent nanophotonic circuits

Yichen Shen¹, Nicholas C. Harris¹, Scott Skirlo¹, Dirk Englund¹, Marin Soljacic¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jul 2017-Vol. 11, Iss: 7, pp 441-446

TL;DR: A new architecture for a fully optical neural network is demonstrated that enables a computational speed enhancement of at least two orders of magnitude and three order of magnitude in power efficiency over state-of-the-art electronics.

read less

Abstract: Artificial Neural Networks have dramatically improved performance for many machine learning tasks. We demonstrate a new architecture for a fully optical neural network that enables a computational speed enhancement of at least two orders of magnitude and three orders of magnitude in power efficiency over state-of-the-art electronics.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

Machine learning and the physical sciences

[...]

Giuseppe Carleo, J. Ignacio Cirac¹, Kyle Cranmer², Laurent Daudet, Maria Schuld³, Naftali Tishby⁴, Leslie Vogt-Maranto², Lenka Zdeborová⁵ - Show less +4 more•Institutions (5)

Max Planck Society¹, New York University², University of KwaZulu-Natal³, Hebrew University of Jerusalem⁴, Université Paris-Saclay⁵

06 Dec 2019-Reviews of Modern Physics

TL;DR: This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences, including conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields.

...read moreread less

Abstract: Machine learning (ML) encompasses a broad range of algorithms and modeling tools used for a vast array of data processing tasks, which has entered most scientific disciplines in recent years. This article reviews in a selective way the recent research on the interface between machine learning and the physical sciences. This includes conceptual developments in ML motivated by physical insights, applications of machine learning techniques to several domains in physics, and cross fertilization between the two fields. After giving a basic notion of machine learning methods and principles, examples are described of how statistical physics is used to understand methods in ML. This review then describes applications of ML methods in particle physics and cosmology, quantum many-body physics, quantum computing, and chemical and material physics. Research and development into novel computing architectures aimed at accelerating ML are also highlighted. Each of the sections describe recent successes as well as domain-specific methodology and challenges.

...read moreread less

1,504 citations

Journal Article•DOI•

Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages

[...]

Cheng Wang¹, Cheng Wang², Mian Zhang², Xi Chen³, Maxime Bertrand⁴, Maxime Bertrand², Amirhassan Shams-Ansari², Amirhassan Shams-Ansari⁵, Sethumadhavan Chandrasekhar³, Peter J. Winzer³, Marko Loncar² - Show less +7 more•Institutions (5)

City University of Hong Kong¹, Harvard University², Bell Labs³, University of Bordeaux⁴, University of Washington⁵

24 Sep 2018-Nature

TL;DR: Monolithically integrated lithium niobate electro-optic modulators that feature a CMOS-compatible drive voltage, support data rates up to 210 gigabits per second and show an on-chip optical loss of less than 0.5 decibels are demonstrated.

...read moreread less

Abstract: Electro-optic modulators translate high-speed electronic signals into the optical domain and are critical components in modern telecommunication networks1,2 and microwave-photonic systems3,4. They are also expected to be building blocks for emerging applications such as quantum photonics5,6 and non-reciprocal optics7,8. All of these applications require chip-scale electro-optic modulators that operate at voltages compatible with complementary metal–oxide–semiconductor (CMOS) technology, have ultra-high electro-optic bandwidths and feature very low optical losses. Integrated modulator platforms based on materials such as silicon, indium phosphide or polymers have not yet been able to meet these requirements simultaneously because of the intrinsic limitations of the materials used. On the other hand, lithium niobate electro-optic modulators, the workhorse of the optoelectronic industry for decades9, have been challenging to integrate on-chip because of difficulties in microstructuring lithium niobate. The current generation of lithium niobate modulators are bulky, expensive, limited in bandwidth and require high drive voltages, and thus are unable to reach the full potential of the material. Here we overcome these limitations and demonstrate monolithically integrated lithium niobate electro-optic modulators that feature a CMOS-compatible drive voltage, support data rates up to 210 gigabits per second and show an on-chip optical loss of less than 0.5 decibels. We achieve this by engineering the microwave and photonic circuits to achieve high electro-optical efficiencies, ultra-low optical losses and group-velocity matching simultaneously. Our scalable modulator devices could provide cost-effective, low-power and ultra-high-speed solutions for next-generation optical communication networks and microwave photonic systems. Furthermore, our approach could lead to large-scale ultra-low-loss photonic circuits that are reconfigurable on a picosecond timescale, enabling a wide range of quantum and classical applications5,10,11 including feed-forward photonic quantum computation. Chip-scale lithium niobate electro-optic modulators that rapidly convert electrical to optical signals and use CMOS-compatible voltages could prove useful in optical communication networks, microwave photonic systems and photonic computation.

...read moreread less

1,358 citations

Journal Article•DOI•

All-optical machine learning using diffractive deep neural networks

[...]

Xing Lin¹, Yair Rivenson¹, Nezih T. Yardimci¹, Muhammed Veli¹, Yi Luo¹, Mona Jarrahi¹, Aydogan Ozcan - Show less +3 more•Institutions (1)

University of California, Los Angeles¹

07 Sep 2018-Science

TL;DR: 3D-printed D2NNs are created that implement classification of images of handwritten digits and fashion products, as well as the function of an imaging lens at a terahertz spectrum.

...read moreread less

Abstract: Deep learning has been transforming our ability to execute advanced inference tasks using computers. Here we introduce a physical mechanism to perform machine learning by demonstrating an all-optical diffractive deep neural network (D2NN) architecture that can implement various functions following the deep learning-based design of passive diffractive layers that work collectively. We created 3D-printed D2NNs that implement classification of images of handwritten digits and fashion products, as well as the function of an imaging lens at a terahertz spectrum. Our all-optical deep learning framework can perform, at the speed of light, various complex functions that computer-based neural networks can execute; will find applications in all-optical image analysis, feature detection, and object classification; and will also enable new camera designs and optical components that perform distinctive tasks using D2NNs.

...read moreread less

1,145 citations

Journal Article•DOI•

All-optical spiking neurosynaptic networks with self-learning capabilities.

[...]

Johannes Feldmann¹, Nathan Youngblood², C.D. Wright³, Harish Bhaskaran², Wolfram H. P. Pernice¹ - Show less +1 more•Institutions (3)

University of Münster¹, University of Oxford², University of Exeter³

08 May 2019-Nature

TL;DR: An optical version of a brain-inspired neurosynaptic system, using wavelength division multiplexing techniques, is presented that is capable of supervised and unsupervised learning.

...read moreread less

Abstract: Software implementations of brain-inspired computing underlie many important computational tasks, from image processing to speech recognition, artificial intelligence and deep learning applications. Yet, unlike real neural tissue, traditional computing architectures physically separate the core computing functions of memory and processing, making fast, efficient and low-energy computing difficult to achieve. To overcome such limitations, an attractive alternative is to design hardware that mimics neurons and synapses. Such hardware, when connected in networks or neuromorphic systems, processes information in a way more analogous to brains. Here we present an all-optical version of such a neurosynaptic system, capable of supervised and unsupervised learning. We exploit wavelength division multiplexing techniques to implement a scalable circuit architecture for photonic neural networks, successfully demonstrating pattern recognition directly in the optical domain. Such photonic neurosynaptic networks promise access to the high speed and high bandwidth inherent to optical systems, thus enabling the direct processing of optical telecommunication and visual data. An optical version of a brain-inspired neurosynaptic system, using wavelength division multiplexing techniques, is presented that is capable of supervised and unsupervised learning.

...read moreread less

862 citations

Journal Article•DOI•

Training Deep Neural Networks for the Inverse Design of Nanophotonic Structures

[...]

Dianjing Liu¹, Yixuan Tan¹, Erfan Khoram¹, Zongfu Yu¹•Institutions (1)

University of Wisconsin-Madison¹

25 Feb 2018-ACS Photonics

TL;DR: A tandem neural network architecture is demonstrated that tolerates inconsistent training instances in inverse design of nanophotonic devices and provides a way to train large neural networks for the inverseDesign of complex photonic structures.

...read moreread less

Abstract: Data inconsistency leads to a slow training process when deep neural networks are used for the inverse design of photonic devices, an issue that arises from the fundamental property of nonuniqueness in all inverse scattering problems. Here we show that by combining forward modeling and inverse design in a tandem architecture, one can overcome this fundamental issue, allowing deep neural networks to be effectively trained by data sets that contain nonunique electromagnetic scattering instances. This paves the way for using deep neural networks to design complex photonic structures that require large training data sets.

...read moreread less

619 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

References

PDF

Open Access

More filters

Proceedings Article•

Unitary evolution recurrent neural networks

[...]

Martin Arjovsky¹, Amar Shah¹, Yoshua Bengio²•Institutions (2)

University of Buenos Aires¹, Université de Montréal²

19 Jun 2016

TL;DR: This work constructs an expressive unitary weight matrix by composing several structured matrices that act as building blocks with parameters to be learned, and demonstrates the potential of this architecture by achieving state of the art results in several hard tasks involving very long-term dependencies.

...read moreread less

Abstract: Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and exploding gradients, especially when trying to learn long-term dependencies. To circumvent this problem, we propose a new architecture that learns a unitary weight matrix, with eigenvalues of absolute value exactly 1. The challenge we address is that of parametrizing unitary matrices in a way that does not require expensive computations (such as eigendecomposition) after each weight update. We construct an expressive unitary weight matrix by composing several structured matrices that act as building blocks with parameters to be learned. Optimization with this parameterization becomes feasible only when considering hidden states in the complex domain. We demonstrate the potential of this architecture by achieving state of the art results in several hard tasks involving very longterm dependencies.

...read moreread less

630 citations

"Deep learning with coherent nanopho..." refers background in this paper

...Furthermore, with this on-chip training scheme, one can readily parametrize and train unitary matrices–an approach known to be particularly useful for deep neural networks [42]....
[...]

Journal Article•DOI•

Optoelectronic reservoir computing.

[...]

Yvan Paquot¹, Francois Duport¹, Anteo Smerieri¹, Joni Dambre², Benjamin Schrauwen², Marc Haelterman¹, Serge Massar¹ - Show less +3 more•Institutions (2)

Université libre de Bruxelles¹, Ghent University²

27 Feb 2012-Scientific Reports

TL;DR: This work reports an optoelectronic implementation of reservoir computing based on a recently proposed architecture consisting of a single non linear node and a delay line that is sufficiently fast for real time information processing.

...read moreread less

Abstract: Reservoir computing is a recently introduced, highly efficient bio-inspired approach for processing time dependent data. The basic scheme of reservoir computing consists of a non linear recurrent dynamical system coupled to a single input layer and a single output layer. Within these constraints many implementations are possible. Here we report an optoelectronic implementation of reservoir computing based on a recently proposed architecture consisting of a single non linear node and a delay line. Our implementation is sufficiently fast for real time information processing. We illustrate its performance on tasks of practical importance such as nonlinear channel equalization and speech recognition, and obtain results comparable to state of the art digital implementations.

...read moreread less

606 citations

Journal Article•DOI•

Optical implementation of the Hopfield model.

[...]

Nabil H. Farhat¹, Demetri Psaltis², Aluizio Prata², Eung Gi Paek²•Institutions (2)

University of Pennsylvania¹, California Institute of Technology²

15 May 1985-Applied Optics

TL;DR: Numerical and experimental results presented show that the approach is capable of introducing accuracy and robustness to optical processing while maintaining the traditional advantages of optics, namely, parallelism and massive interconnection capability.

...read moreread less

Abstract: Optical implementation of content addressable associative memory based on the Hopfield model for neural networks and on the addition of nonlinear iterative feedback to a vector–matrix multiplier is described. Numerical and experimental results presented show that the approach is capable of introducing accuracy and robustness to optical processing while maintaining the traditional advantages of optics, namely, parallelism and massive interconnection capability. Moreover a potentially useful link between neural processing and optics that can be of interest in pattern recognition and machine vision is established.

...read moreread less

584 citations

Journal Article•DOI•

Zero-bias 40Gbit/s germanium waveguide photodetector on silicon.

[...]

Laurent Vivien¹, A. Polzer², Delphine Marris-Morini¹, Johann Osmond¹, Jean-Michel Hartmann, Paul Crozat¹, Eric Cassan¹, Christophe Kopp, Horst Zimmermann², Jean-Marc Fedeli - Show less +6 more•Institutions (2)

Centre national de la recherche scientifique¹, Vienna University of Technology²

16 Jan 2012-Optics Express

TL;DR: A very high optical bandwidth, estimated up to 120GHz, was evidenced in 10 µm long Ge photodetectors selectively grown at the end of silicon waveguides using three kinds of experimental set-ups.

...read moreread less

Abstract: We report on lateral pin germanium photodetectors selectively grown at the end of silicon waveguides. A very high optical bandwidth, estimated up to 120GHz, was evidenced in 10 µm long Ge photodetectors using three kinds of experimental set-ups. In addition, a responsivity of 0.8 A/W at 1550 nm was measured. An open eye diagrams at 40Gb/s were demonstrated under zero-bias at a wavelength of 1.55 µm.

...read moreread less

417 citations

Journal Article•DOI•

Monolayer Graphene as a Saturable Absorber in a Mode-Locked Laser

[...]

Qiaoliang Bao¹, Han Zhang², Zhenhua Ni², Yu Wang¹, Lakshminarayana Polavarapu¹, Zexiang Shen², Qing-Hua Xu¹, Dingyuan Tang², Kian Ping Loh¹ - Show less +5 more•Institutions (2)

National University of Singapore¹, Nanyang Technological University²

01 Mar 2011-Nano Research

TL;DR: In this article, the intrinsic properties of monolayer graphene allow it to act as a more effective saturable absorber for mode-locking fiber lasers when compared to multilayer graphene.

...read moreread less

Abstract: We demonstrate that the intrinsic properties of monolayer graphene allow it to act as a more effective saturable absorber for mode-locking fiber lasers when compared to multilayer graphene. The absorption of monolayer graphene can be saturated at lower excitation intensity compared to multilayer graphene, graphene with wrinkle-like defects, or functionalized graphene. Monolayer graphene has a remarkably large modulation depth of 65.9%, whereas the modulation depth of multilayer graphene is greatly reduced due to nonsaturable absorption and scattering loss. Picosecond ultrafast laser pulses (1.23 ps) can be generated using monolayer graphene as a saturable absorber. Due to the ultrafast relaxation time, larger modulation depth and lower scattering loss of monolayer graphene, it performs better than multilayer graphene in terms of pulse shaping ability, pulse stability, and output energy.

...read moreread less

406 citations