scispace - formally typeset
Proceedings ArticleDOI

1.1 Computing's energy problem (and what we can do about it)

Reads0
Chats0
TLDR
If the drive for performance and the end of voltage scaling have made power, and not the number of transistors, the principal factor limiting further improvements in computing performance, a new wave of innovative and efficient computing devices will be created.
Abstract
Our challenge is clear: The drive for performance and the end of voltage scaling have made power, and not the number of transistors, the principal factor limiting further improvements in computing performance. Continuing to scale compute performance will require the creation and effective use of new specialized compute engines, and will require the participation of application experts to be successful. If we play our cards right, and develop the tools that allow our customers to become part of the design process, we will create a new wave of innovative and efficient computing devices.

read more

Citations
More filters
Proceedings ArticleDOI

Moving CNN Accelerator Computations Closer to Data

TL;DR: This work re-purpose the CPU's last level cache to perform in-situ dot-product computations, thus significantly reducing data movement and boosting throughput.
Proceedings ArticleDOI

Fast FPGA-Based Emulation for ReRAM-Enabled Deep Neural Network Accelerator

TL;DR: The proposed emulation helps to build better ReRAM accelerators for large DNNs with much higher speed and flexibility and is co-designed with runtime software stacks to make the hardware emulation more flexible via instruction compilation and scheduling for different DNN needs.
Journal ArticleDOI

Sparse-PE: A Performance-Efficient Processing Engine Core for Sparse Convolutional Neural Networks

TL;DR: The Sparse-PE core as mentioned in this paper uses binary mask representation and actively skips computations involving zeros and favors non-zero computations, thereby, drastically increasing the effective throughput and hardware utilization.
Journal ArticleDOI

Multifunctional computing-in-memory SRAM cells based on two-surface-channel MoS2 transistors

TL;DR: In this paper, two kinds of computing-in-memory designs based on two-surface-channel MoS2 transistors are proposed: symmetrical 4T2R Static Random-Access Memory (SRAM) cell and skewed 3T3R SRAM cell, where the symmetrical SRAM cells can realize in-memory XNOR/XOR computations and the skewed SRAMcell can achieve inmemory NAND/NOR computations.
References
More filters
Journal ArticleDOI

Design of ion-implanted MOSFET's with very small physical dimensions

TL;DR: This paper considers the design, fabrication, and characterization of very small Mosfet switching devices suitable for digital integrated circuits, using dimensions of the order of 1 /spl mu/.
Book

Low Power Digital CMOS Design

TL;DR: The Hierarchy of Limits of Power J.D. Stratakos, et al., and Low Power Programmable Computation coauthored with M.B. Srivastava, provide a review of the main approaches to Voltage Scaling Approaches.
Journal ArticleDOI

Towards energy-proportional datacenter memory with mobile DRAM

TL;DR: This work architects server memory systems using mobile DRAM devices, trading peak bandwidth for lower energy consumption per bit and more efficient idle modes, and demonstrates 3-5× lower memory power, better proportionality, and negligible performance penalties for data-center workloads.
Related Papers (5)