scispace - formally typeset
S

Stephen Richardson

Researcher at Stanford University

Publications -  43
Citations -  1646

Stephen Richardson is an academic researcher from Stanford University. The author has contributed to research in topics: Compiler & Cache. The author has an hindex of 20, co-authored 39 publications receiving 1452 citations. Previous affiliations of Stephen Richardson include Sun Microsystems & Sun Microsystems Laboratories.

Papers
More filters
Proceedings ArticleDOI

Understanding sources of inefficiency in general-purpose chips

TL;DR: The sources of these performance and energy overheads in general-purpose processing systems are explored by quantifying the overheads of a 720p HD H.264 encoder running on a general- Purpose CMP system and exploring methods to eliminate these overheads by transforming the CPU into a specialized system for H. 264 encoding.
Journal ArticleDOI

Dark Memory and Accelerator-Rich System Optimization in the Dark Silicon Era

TL;DR: The dark memory state and present Pareto curves for compute units, accelerators, and on-chip memory, and motivates the need for HW/SW codesign for parallelism and locality are discussed.
Journal ArticleDOI

Rethinking Digital Design: Why Design Must Change

TL;DR: Domain-specific chip generators are templates that codify designer knowledge and design trade-offs to create different application-optimized chips to reduce design costs.

Caching Function Results: Faster Arithmetic by Avoiding Unnecessary Computation

TL;DR: Using two separate benchmark suites, the SPEC benchmarks and the Perfect Club, and concentrating on multiplication, a surprising amount of trivial and redundant operation is found.
Journal ArticleDOI

Programming Heterogeneous Systems from an Image Processing DSL

TL;DR: The image processing language Halide is extended so users can specify which portions of their applications should become hardware accelerators, and a compiler is provided that uses this code to automatically create the accelerator along with the “glue” code needed for the user’s application to access this hardware.