Home
/
Authors
/
Yunsup Lee

Author

Yunsup Lee

Other affiliations: Singapore General Hospital, University of California

Bio: Yunsup Lee is an academic researcher from University of California, Berkeley. The author has contributed to research in topics: Code generation & Compiler. The author has an hindex of 23, co-authored 40 publications receiving 3801 citations. Previous affiliations of Yunsup Lee include Singapore General Hospital & University of California.

Topics: Code generation, Compiler, Microarchitecture, Instruction set, RISC-V ...read more

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Single-chip microprocessor that communicates directly using light

[...]

Chen Sun¹, Chen Sun², Mark T. Wade³, Yunsup Lee¹, Jason S. Orcutt², Jason S. Orcutt⁴, Luca Alloatti², Michael Georgas², Andrew Waterman¹, Jeffrey M. Shainline³, Jeffrey M. Shainline⁴, Rimas Avizienis¹, Sen Lin¹, Benjamin Moss², Rajesh Kumar³, Fabio Pavanello³, Amir H. Atabaki², Henry Cook¹, Albert Ou¹, Jonathan Leu², Yu-Hsin Chen², Krste Asanovic¹, Rajeev J. Ram², Milos A. Popovic³, Vladimir Stojanovic¹ - Show less +21 more•Institutions (4)

University of California, Berkeley¹, Massachusetts Institute of Technology², University of Colorado Boulder³, National Institute of Standards and Technology⁴

24 Dec 2015-Nature

TL;DR: This demonstration could represent the beginning of an era of chip-scale electronic–photonic systems with the potential to transform computing system architectures, enabling more powerful computers, from network infrastructure to data centres and supercomputers.

...read moreread less

Abstract: An electronic–photonic microprocessor chip manufactured using a conventional microelectronics foundry process is demonstrated; the chip contains 70 million transistors and 850 photonic components and directly uses light to communicate to other chips. The rapid transfer of data between chips in computer systems and data centres has become one of the bottlenecks in modern information processing. One way of increasing speeds is to use optical connections rather than electrical wires and the past decade has seen significant efforts to develop silicon-based nanophotonic approaches to integrate such links within silicon chips, but incompatibility between the manufacturing processes used in electronics and photonics has proved a hindrance. Now Chen Sun et al. describe a 'system on a chip' microprocessor that successfully integrates electronics and photonics yet is produced using standard microelectronic chip fabrication techniques. The resulting microprocessor combines 70 million transistors and 850 photonic components and can communicate optically with the outside world. This result promises a way forward for new fast, low-power computing systems architectures. Data transport across short electrical wires is limited by both bandwidth and power density, which creates a performance bottleneck for semiconductor microchips in modern computer systems—from mobile phones to large-scale data centres. These limitations can be overcome1,2,3 by using optical communications based on chip-scale electronic–photonic systems4,5,6,7 enabled by silicon-based nanophotonic devices8. However, combining electronics and photonics on the same chip has proved challenging, owing to microchip manufacturing conflicts between electronics and photonics. Consequently, current electronic–photonic chips9,10,11 are limited to niche manufacturing processes and include only a few optical devices alongside simple circuits. Here we report an electronic–photonic system on a single chip integrating over 70 million transistors and 850 photonic components that work together to provide logic, memory, and interconnect functions. This system is a realization of a microprocessor that uses on-chip photonic devices to directly communicate with other chips using light. To integrate electronics and photonics at the scale of a microprocessor chip, we adopt a ‘zero-change’ approach to the integration of photonics. Instead of developing a custom process to enable the fabrication of photonics12, which would complicate or eliminate the possibility of integration with state-of-the-art transistors at large scale and at high yield, we design optical devices using a standard microelectronics foundry process that is used for modern microprocessors13,14,15,16. This demonstration could represent the beginning of an era of chip-scale electronic–photonic systems with the potential to transform computing system architectures, enabling more powerful computers, from network infrastructure to data centres and supercomputers.

...read moreread less

1,058 citations

Proceedings Article•DOI•

Chisel: constructing hardware in a Scala embedded language

[...]

Jonathan Bachrach¹, Huy Vo¹, Brian Richards¹, Yunsup Lee¹, Andrew Waterman¹, Rimas Avizienis¹, John Wawrzynek¹, Krste Asanovic¹ - Show less +4 more•Institutions (1)

University of California, Berkeley¹

03 Jun 2012

TL;DR: Chisel, a new hardware construction language that supports advanced hardware design using highly parameterized generators and layered domain-specific hardware languages, is introduced by embedding Chisel in the Scala programming language, raising the level of hardware design abstraction.

...read moreread less

Abstract: In this paper we introduce Chisel, a new hardware construction language that supports advanced hardware design using highly parameterized generators and layered domain-specific hardware languages. By embedding Chisel in the Scala programming language, we raise the level of hardware design abstraction by providing concepts including object orientation, functional programming, parameterized types, and type inference. Chisel can generate a high-speed C++-based cycle-accurate software simulator, or low-level Verilog designed to map to either FPGAs or to a standard ASIC flow for synthesis. This paper presents Chisel, its embedding in Scala, hardware examples, and results for C++ simulation, Verilog emulation and ASIC synthesis.

...read moreread less

697 citations

The RISC-V Instruction Set Manual

[...]

Andrew Waterman, Yunsup Lee, David A. Patterson, Krste Asanovi

01 Jan 2014

TL;DR: This draft specification may change before being accepted as standard by the RISC-V Foundation, and it remains possible that implementations made to this draft specification will not conform to the future standard.

...read moreread less

Abstract: Volume II: Privileged Architecture Privileged Architecture Version 1.10 Document Version 1.10 Warning! This draft specification may change before being accepted as standard by the RISC-V Foundation. While the editors intend future changes to this specification to be forward compatible, it remains possible that implementations made to this draft specification will not conform to the future standard.

...read moreread less

583 citations

Journal Article•DOI•

PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation

[...]

Andreas Klöckner¹, Nicolas Pinto², Yunsup Lee³, Bryan Catanzaro³, Paul Ivanov³, Ahmed Fasih⁴ - Show less +2 more•Institutions (4)

Courant Institute of Mathematical Sciences¹, McGovern Institute for Brain Research², University of California, Berkeley³, Ohio State University⁴

01 Mar 2012

TL;DR: This article proposes the combination of a dynamic, high-level scripting language with the massive performance of a GPU as a compelling two-tiered computing platform, potentially offering significant performance and productivity advantages over conventional single-tier, static systems.

...read moreread less

Abstract: High-performance computing has recently seen a surge of interest in heterogeneous systems, with an emphasis on modern Graphics Processing Units (GPUs). These devices offer tremendous potential for performance and efficiency in important large-scale applications of computational science. However, exploiting this potential can be challenging, as one must adapt to the specialized and rapidly evolving computing environment currently exhibited by GPUs. One way of addressing this challenge is to embrace better techniques and develop tools tailored to their needs. This article presents one simple technique, GPU run-time code generation (RTCG), along with PyCUDA and PyOpenCL, two open-source toolkits that supports this technique. In introducing PyCUDA and PyOpenCL, this article proposes the combination of a dynamic, high-level scripting language with the massive performance of a GPU as a compelling two-tiered computing platform, potentially offering significant performance and productivity advantages over conventional single-tier, static systems. The concept of RTCG is simple and easily implemented using existing, robust infrastructure. Nonetheless it is powerful enough to support (and encourage) the creation of custom application-specific tools by its users. The premise of the paper is illustrated by a wide range of examples where the technique has been applied with considerable success.

...read moreread less

561 citations

Report•DOI•

The RISC-V Instruction Set Manual. Volume 1: User-Level ISA, Version 2.0

[...]

Andrew Waterman, Yunsup Lee, David A. Patterson, Krste Asanovi

06 May 2014

TL;DR: RISC-V (pronounced risk-five) is a new instruction set architecture (ISA) that was originally designed to support computer architecture research and education, but which it is hoped will become a standard open architecture for industry implementations.

...read moreread less

Abstract: : RISC-V (pronounced risk-five) is a new instruction set architecture (ISA) that was originally designed to support computer architecture research and education, but which we now hope will become a standard open architecture for industry implementations. The RISC-V manual is structured in two volumes. This volume covers the user-level ISA design, including optional ISA extensions. The second volume provides examples of supervisor-level ISA design.

...read moreread less

233 citations

1
2
3
4
…
5
6
7
8

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

Contour Detection and Hierarchical Image Segmentation

[...]

Pablo Arbeláez¹, Michael Maire², Charless C. Fowlkes³, Jitendra Malik¹•Institutions (3)

University of California, Berkeley¹, California Institute of Technology², University of California, Irvine³

01 May 2011-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper investigates two fundamental problems in computer vision: contour detection and image segmentation and presents state-of-the-art algorithms for both of these tasks.

...read moreread less

Abstract: This paper investigates two fundamental problems in computer vision: contour detection and image segmentation. We present state-of-the-art algorithms for both of these tasks. Our contour detector combines multiple local cues into a globalization framework based on spectral clustering. Our segmentation algorithm consists of generic machinery for transforming the output of any contour detector into a hierarchical region tree. In this manner, we reduce the problem of image segmentation to that of contour detection. Extensive experimental evaluation demonstrates that both our contour detection and segmentation methods significantly outperform competing algorithms. The automatically generated hierarchical segmentations can be interactively refined by user-specified annotations. Computation at multiple image resolutions provides a means of coupling our system to recognition applications.

...read moreread less

5,068 citations

Journal Article•DOI•

Deep learning with coherent nanophotonic circuits

[...]

Yichen Shen¹, Nicholas C. Harris¹, Scott Skirlo¹, Dirk Englund¹, Marin Soljacic¹ - Show less +1 more•Institutions (1)

Massachusetts Institute of Technology¹

01 Jul 2017

TL;DR: A new architecture for a fully optical neural network is demonstrated that enables a computational speed enhancement of at least two orders of magnitude and three order of magnitude in power efficiency over state-of-the-art electronics.

...read moreread less

Abstract: Artificial Neural Networks have dramatically improved performance for many machine learning tasks. We demonstrate a new architecture for a fully optical neural network that enables a computational speed enhancement of at least two orders of magnitude and three orders of magnitude in power efficiency over state-of-the-art electronics.

...read moreread less

1,955 citations

Journal Article•DOI•

Integrated lithium niobate electro-optic modulators operating at CMOS-compatible voltages

[...]

Cheng Wang¹, Cheng Wang², Mian Zhang², Xi Chen³, Maxime Bertrand², Maxime Bertrand⁴, Amirhassan Shams-Ansari², Amirhassan Shams-Ansari⁵, Sethumadhavan Chandrasekhar³, Peter J. Winzer³, Marko Loncar² - Show less +7 more•Institutions (5)

City University of Hong Kong¹, Harvard University², Bell Labs³, University of Bordeaux⁴, University of Washington⁵

24 Sep 2018-Nature

TL;DR: Monolithically integrated lithium niobate electro-optic modulators that feature a CMOS-compatible drive voltage, support data rates up to 210 gigabits per second and show an on-chip optical loss of less than 0.5 decibels are demonstrated.

...read moreread less

Abstract: Electro-optic modulators translate high-speed electronic signals into the optical domain and are critical components in modern telecommunication networks1,2 and microwave-photonic systems3,4. They are also expected to be building blocks for emerging applications such as quantum photonics5,6 and non-reciprocal optics7,8. All of these applications require chip-scale electro-optic modulators that operate at voltages compatible with complementary metal–oxide–semiconductor (CMOS) technology, have ultra-high electro-optic bandwidths and feature very low optical losses. Integrated modulator platforms based on materials such as silicon, indium phosphide or polymers have not yet been able to meet these requirements simultaneously because of the intrinsic limitations of the materials used. On the other hand, lithium niobate electro-optic modulators, the workhorse of the optoelectronic industry for decades9, have been challenging to integrate on-chip because of difficulties in microstructuring lithium niobate. The current generation of lithium niobate modulators are bulky, expensive, limited in bandwidth and require high drive voltages, and thus are unable to reach the full potential of the material. Here we overcome these limitations and demonstrate monolithically integrated lithium niobate electro-optic modulators that feature a CMOS-compatible drive voltage, support data rates up to 210 gigabits per second and show an on-chip optical loss of less than 0.5 decibels. We achieve this by engineering the microwave and photonic circuits to achieve high electro-optical efficiencies, ultra-low optical losses and group-velocity matching simultaneously. Our scalable modulator devices could provide cost-effective, low-power and ultra-high-speed solutions for next-generation optical communication networks and microwave photonic systems. Furthermore, our approach could lead to large-scale ultra-low-loss photonic circuits that are reconfigurable on a picosecond timescale, enabling a wide range of quantum and classical applications5,10,11 including feed-forward photonic quantum computation. Chip-scale lithium niobate electro-optic modulators that rapidly convert electrical to optical signals and use CMOS-compatible voltages could prove useful in optical communication networks, microwave photonic systems and photonic computation.

...read moreread less

1,358 citations

Journal Article•DOI•

Single-chip microprocessor that communicates directly using light

[...]

Chen Sun¹, Chen Sun², Mark T. Wade³, Yunsup Lee², Jason S. Orcutt¹, Jason S. Orcutt⁴, Luca Alloatti¹, Michael Georgas¹, Andrew Waterman², Jeffrey M. Shainline³, Jeffrey M. Shainline⁴, Rimas Avizienis², Sen Lin², Benjamin Moss¹, Rajesh Kumar³, Fabio Pavanello³, Amir H. Atabaki¹, Henry Cook², Albert Ou², Jonathan Leu¹, Yu-Hsin Chen¹, Krste Asanovic², Rajeev J. Ram¹, Milos A. Popovic³, Vladimir Stojanovic² - Show less +21 more•Institutions (4)

Massachusetts Institute of Technology¹, University of California, Berkeley², University of Colorado Boulder³, National Institute of Standards and Technology⁴

24 Dec 2015-Nature

...read moreread less

1,058 citations

Proceedings Article•DOI•

Structured Forests for Fast Edge Detection

[...]

Piotr Dollár¹, C. Lawrence Zitnick¹•Institutions (1)

Microsoft¹

01 Dec 2013

TL;DR: This paper forms the problem of predicting local edge masks in a structured learning framework applied to random decision forests and develops a novel approach to learning decision trees robustly maps the structured labels to a discrete space on which standard information gain measures may be evaluated.

...read moreread less

Abstract: Edge detection is a critical component of many vision systems, including object detectors and image segmentation algorithms. Patches of edges exhibit well-known forms of local structure, such as straight lines or T-junctions. In this paper we take advantage of the structure present in local image patches to learn both an accurate and computationally efficient edge detector. We formulate the problem of predicting local edge masks in a structured learning framework applied to random decision forests. Our novel approach to learning decision trees robustly maps the structured labels to a discrete space on which standard information gain measures may be evaluated. The result is an approach that obtains real time performance that is orders of magnitude faster than many competing state-of-the-art approaches, while also achieving state-of-the-art edge detection results on the BSDS500 Segmentation dataset and NYU Depth dataset. Finally, we show the potential of our approach as a general purpose edge detector by showing our learned edge models generalize well across datasets.

...read moreread less

981 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse