scispace - formally typeset
X

Xianyi Zhang

Researcher at Chinese Academy of Sciences

Publications -  6
Citations -  242

Xianyi Zhang is an academic researcher from Chinese Academy of Sciences. The author has contributed to research in topics: Xeon & x86. The author has an hindex of 3, co-authored 6 publications receiving 211 citations.

Papers
More filters
Proceedings ArticleDOI

AUGEM: automatically generate high performance dense linear algebra kernels on x86 CPUs

TL;DR: A template-based optimization framework, AUGEM, is presented, which can automatically generate fully optimized assembly code for several dense linear algebra kernels, such as GEMM, GEMV, AXPY and DOT, on varying multi-core CPUs without requiring any manual interference from developers.
Proceedings ArticleDOI

Optimizing SpMV for Diagonal Sparse Matrices on GPU

TL;DR: This paper proposes a new storage format for diagonal sparse matrices, defined as Compressed Row Segment with Diagonal-pattern (CRSD), and designs diagonal patterns to represent the diagonal distribution in CRSD.
Book ChapterDOI

CRSD: application specific auto-tuning of SpMV for diagonal sparse matrices

TL;DR: A new storage format for diagonal sparse matrices, defined as Compressed Row Segment with Diagonal-pattern (CRSD), which achieves speedup up to 2.60 in comparison with CSR under the same number of threads on two mainstream multi-core platforms.
Journal ArticleDOI

Memory Efficient Two-Pass 3D FFT Algorithm for Intel® Xeon PhiTM Coprocessor

TL;DR: This paper proposes a two-pass 3D FFT algorithm, which mainly aims to reduce the amount of explicit data transfer between the memory and the on-chip cache, and evaluates the algorithm on the Intel® Xeon PhiTM coprocessor 7110P, and achieves a maximum performance of 136 Gflops with 240 threads in offload mode.
Proceedings ArticleDOI

QuantWiz: A Parallel Software Package for LC-MS-based Label-Free Protein Quantification

TL;DR: This paper described the framework design and prototype development of this new domestic parallel software package called QuantWiz for high performance Liquid Chromatography (short for LC)-MS-based label-free protein quantification, and implemented the parallelization version.