scispace - formally typeset
S

Stephen W. Keckler

Researcher at Nvidia

Publications -  200
Citations -  17541

Stephen W. Keckler is an academic researcher from Nvidia. The author has contributed to research in topics: Cache & Thread (computing). The author has an hindex of 62, co-authored 191 publications receiving 15480 citations. Previous affiliations of Stephen W. Keckler include Massachusetts Institute of Technology & Stanford University.

Papers
More filters
Proceedings ArticleDOI

Modeling the effect of technology trends on the soft error rate of combinational logic

TL;DR: An end-to-end model is described and validated that enables us to compute the soft error rates (SER) for existing and future microprocessor-style designs and predicts that the SER per chip of logic circuits will increase nine orders of magnitude from 1992 to 2011 and at that point will be comparable to the SERper chip of unprotected memory elements.
Proceedings ArticleDOI

SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks

TL;DR: The Sparse CNN (SCNN) accelerator as discussed by the authors employs a dataflow that enables maintaining the sparse weights and activations in a compressed encoding, which eliminates unnecessary data transfers and reduces storage requirements.
Proceedings ArticleDOI

An adaptive, non-uniform cache structure for wire-delay dominated on-chip caches

TL;DR: This paper proposes physical designs for these Non-Uniform Cache Architectures (NUCAs) and extends these physical designs with logical policies that allow important data to migrate toward the processor within the same level of the cache.
Proceedings ArticleDOI

Clock rate versus IPC: the end of the road for conventional microarchitectures

TL;DR: This paper describes technology-driven models for wire capacitance wire delay, and microarchitectural component delay and finds that no scaling strategy permits annual performance improvements of better than 12.5% which is far worse than the annual 50-60% to which the authors have grown accustomed.
Journal ArticleDOI

GPUs and the Future of Parallel Computing

TL;DR: The capabilities of state-of-the art GPU-based high-throughput computing systems are discussed and the challenges to scaling single-chip parallel-computing systems are considered, highlighting high-impact areas that the computing research community can address.