scispace - formally typeset
P

Peng Zhao

Researcher at IBM

Publications -  10
Citations -  504

Peng Zhao is an academic researcher from IBM. The author has contributed to research in topics: SIMD & Compiler. The author has an hindex of 6, co-authored 10 publications receiving 504 citations.

Papers
More filters
Proceedings ArticleDOI

Optimizing Compiler for the CELL Processor

TL;DR: Several compiler techniques that aim at automatically generating high quality codes over a wide range of heterogeneous parallelism available on the CELL processor are described and results indicate that significant speedup can be achieved with a high level of support from the compiler.
Journal ArticleDOI

Using advanced compiler technology to exploit the performance of the Cell Broadband Engine TM architecture

TL;DR: The goal in developing this compiler has been to enhance programmability while continuing to provide high performance, and the results of the compiler techniques, including SPE optimization, automatic code generation, single source parallelization, and partitioning are presented.
Proceedings ArticleDOI

An integrated simdization framework using virtual vectors

TL;DR: This paper proposes aSimdization framework that addresses several orthogonal aspects of simdization, such as alignment handling, simdized of loops with mixed data lengths, and SIMD parallelism extraction from different program scopes (from basic blocks to inner loops).
Patent

Efficient generation of SIMD code in presence of multi-threading and other false sharing conditions and in machines having memory protection support

TL;DR: In this article, the authors present a method for automatically generating SIMD code from static analysis of data to be accessed by a targeted loop including at least one statement, where each statement has at least a memory reference, to determine if memory accesses are safe.
Patent

Method to analyze and reduce number of data reordering operations in SIMD code

TL;DR: In this paper, a method for analyzing data reordering operations in single issue multiple data source code and generating executable code therefrom is provided, where one or more virtual shuffle trees are collapsed by combining virtual shuffle operations within at least one of the virtual shuffle tree sub-structures.