Open AccessBook
The software vectorization handbook
About:
The article was published on 2004-05-01 and is currently open access. It has received 54 citations till now. The article focuses on the topics: Image tracing.read more
Citations
More filters
Journal ArticleDOI
Polyhedral parallel code generation for CUDA
Sven Verdoolaege,Juan Carlos Juega,Albert Cohen,José Ignacio Gómez,Christian Tenllado,Francky Catthoor +5 more
TL;DR: A novel source-to-source compiler called PPCG is presented, which introduces a multilevel tiling strategy and a code generation scheme for the parallelization and locality optimization of imperfectly nested loops, managing memory and exposing concurrency according to the constraints of modern GPUs.
Journal ArticleDOI
Auto-vectorization of interleaved data for SIMD
Dorit Nuzman,Ira Rosen,Ayal Zaks +2 more
TL;DR: This work demonstrates an automatic compilation scheme that supports effective vectorization in the presence of interleaved data with constant strides that are powers of 2, facilitating data reorganization.
Proceedings ArticleDOI
Outer-loop vectorization: revisited for short SIMD architectures
Dorit Nuzman,Ayal Zaks +1 more
TL;DR: This paper revisit the method of outer loop vectorization, paying special attention to properties of modern short SIMD architectures, and presents an optimization tapping such opportunities, capable of further boosting the performance obtained by outer-loop vectorization to achieve average speedup factors of 5.26 and 3.64.
Proceedings ArticleDOI
Polyhedral-Model Guided Loop-Nest Auto-Vectorization
TL;DR: This paper examines the interactions between loop transformations of the polyhedral compilation framework and subsequent vectorization optimizations targeting fine-grain SIMD data-level parallelism, and demonstrates the feasibility and benefit of tuning thepolyhedral model in the context of vectorization.
Proceedings ArticleDOI
Multi-platform Auto-vectorization
Dorit Nuzman,Richard Henderson +1 more
TL;DR: This paper presents an automatic vectorizer as implemented in GCC, the most multi-targetable compiler available today, and demonstrates how the vectorization scheme is suited to a variety of SIMD architectures.