Joo-Young Kim

Journal ArticleDOI

A reconfigurable fabric for accelerating large-scale datacenter services

- 28 Oct 2016 -

TL;DR: The authors deployed the reconfigurable fabric in a bed of 1,632 servers and FPGAs in a production datacenter and successfully used it to accelerate the ranking portion of the Bing Web search engine by nearly a factor of two.

...read moreread less

Journal ArticleDOI

A reconfigurable fabric for accelerating large-scale datacenter services

Andrew Putnam, +22 more

TL;DR: The requirements and architecture of the fabric are described, the critical engineering challenges and solutions needed to make the system robust in the presence of failures are detailed, and the performance, power, and resilience of the system when ranking candidate documents are measured.

...read moreread less

Proceedings ArticleDOI

A cloud-scale acceleration architecture

Adrian M. Caulfield, +17 more

TL;DR: A new cloud architecture that uses reconfigurable logic to accelerate both network plane functions and applications, and is much more scalable than prior work which used secondary rack-scale networks for inter-FPGA communication.

...read moreread less

Accelerating Deep Convolutional Neural Networks Using Specialized Hardware

Kalin Ovtcharov, +5 more

TL;DR: Hardware specialization in the form of GPGPUs, FPGAs, and ASICs offers a promising path towards major leaps in processing capability while achieving high energy efficiency, and combining multiple FPGA over a low-latency communication fabric offers further opportunity to train and evaluate models of unprecedented size and quality.

...read moreread less

Journal ArticleDOI

A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine

Joo-Young Kim, +8 more

TL;DR: In the proposed hardware architecture, three recognition tasks (visual perception, descriptor generation, and object decision) are directly mapped to the neural perception engine, 16 SIMD processors including 128 processing elements, and decision processor and executed in the pipeline to maximize throughput of the object recognition.

...read moreread less

Papers

A reconfigurable fabric for accelerating large-scale datacenter services

A reconfigurable fabric for accelerating large-scale datacenter services

A cloud-scale acceleration architecture

Accelerating Deep Convolutional Neural Networks Using Specialized Hardware

A 201.4 GOPS 496 mW Real-Time Multi-Object Recognition Processor With Bio-Inspired Neural Perception Engine