scispace - formally typeset
J

Joel Emer

Researcher at Massachusetts Institute of Technology

Publications -  187
Citations -  22934

Joel Emer is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Cache & Cache pollution. The author has an hindex of 59, co-authored 172 publications receiving 18023 citations. Previous affiliations of Joel Emer include Intel & Hewlett-Packard.

Papers
More filters
Journal ArticleDOI

Efficient Processing of Deep Neural Networks: A Tutorial and Survey

TL;DR: In this paper, the authors provide a comprehensive tutorial and survey about the recent advances toward the goal of enabling efficient processing of DNNs, and discuss various hardware platforms and architectures that support DNN, and highlight key trends in reducing the computation cost of deep neural networks either solely via hardware design changes or via joint hardware and DNN algorithm changes.
Journal ArticleDOI

Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks

TL;DR: Eyeriss as mentioned in this paper is an accelerator for state-of-the-art deep convolutional neural networks (CNNs) that optimizes for the energy efficiency of the entire system, including the accelerator chip and off-chip DRAM, by reconfiguring the architecture.
Journal ArticleDOI

Eyeriss: a spatial architecture for energy-efficient dataflow for convolutional neural networks

TL;DR: A novel dataflow, called row-stationary (RS), is presented, that minimizes data movement energy consumption on a spatial architecture and can adapt to different CNN shape configurations and reduces all types of data movement through maximally utilizing the processing engine local storage, direct inter-PE communication and spatial parallelism.
Proceedings ArticleDOI

A systematic methodology to compute the architectural vulnerability factors for a high-performance microprocessor

TL;DR: This paper identifies numerous cases, such as prefetches, dynamicallydead code, and wrong-path instructions, in which a fault will not affect correct execution, and shows AVFs of 28% and 9% for the instruction queue and execution units, respectively,averaged across dynamic sections of the entire CPU2000benchmark suite.
Proceedings ArticleDOI

Exploiting Choice: Instruction Fetch and Issue on an Implementable Simultaneous Multithreading Processor

TL;DR: This paper presents an architecture for simultaneous multithreading that minimizes the architectural impact on the conventional superscalar design, has minimal performance impact on a single thread executing alone, and achieves significant throughput gains when running multiple threads.