scispace - formally typeset
S

Shoaib Kamil

Researcher at Adobe Systems

Publications -  84
Citations -  4429

Shoaib Kamil is an academic researcher from Adobe Systems. The author has contributed to research in topics: Compiler & Code generation. The author has an hindex of 29, co-authored 80 publications receiving 3773 citations. Previous affiliations of Shoaib Kamil include Lawrence Berkeley National Laboratory & University of California, Berkeley.

Papers
More filters
Proceedings ArticleDOI

OpenTuner: an extensible framework for program autotuning

TL;DR: The efficacy and generality of OpenTuner are demonstrated by building autotuners for 7 distinct projects and 16 total benchmarks, showing speedups over prior techniques of these projects of up to 2.8χ with little programmer effort.

The Potential of the Cell Processor for Scientific Computing

TL;DR: In this article, the authors examined the potential of using the STI Cell processor as a building block for future high-end computing systems and proposed modest microarchitectural modifications that could significantly increase the efficiency of double-precision calculations.
Proceedings ArticleDOI

The potential of the cell processor for scientific computing

TL;DR: This work introduces a performance model for Cell and applies it to several key scientific computing kernels: dense matrix multiply, sparse matrix vector multiply, stencil computations, and 1D/2D FFTs, and proposes modest microarchitectural modifications that could significantly increase the efficiency of double-precision calculations.
Journal ArticleDOI

Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors

TL;DR: Results demonstrate that recent trends in memory system organization have reduced the eficacy of traditional cache- blocking optimizations, and represent one of the most extensive analyses of stencil optimizations and performance modeling to date.
Proceedings ArticleDOI

An auto-tuning framework for parallel multicore stencil computations

TL;DR: In this article, the authors present a stencil auto-tuning framework that significantly advances programmer productivity by automatically converting a straightforward sequential Fortran 95 stencil expression into tuned parallel implementations in Fortran, C, or CUDA.