GPU-accelerated preconditioned iterative linear solvers

doi:10.1007/S11227-012-0825-3

Journal ArticleDOI

GPU-accelerated preconditioned iterative linear solvers

Ruipeng Li, +1 more

- 01 Feb 2013 -

The Journal of Supercomputing

- Vol. 63, Iss: 2, pp 443-466

TLDR

This work is an overview of the preliminary experience in developing a high-performance iterative linear solver accelerated by GPU coprocessors and techniques for speeding up sparse matrix-vector product (SpMV) kernels and finding suitable preconditioning methods are discussed.

Abstract:

This work is an overview of our preliminary experience in developing a high-performance iterative linear solver accelerated by GPU coprocessors. Our goal is to illustrate the advantages and difficulties encountered when deploying GPU technology to perform sparse linear algebra computations. Techniques for speeding up sparse matrix-vector product (SpMV) kernels and finding suitable preconditioning methods are discussed. Our experiments with an NVIDIA TESLA M2070 show that for unstructured matrices SpMV kernels can be up to 8 times faster on the GPU than the Intel MKL on the host Intel Xeon X5675 Processor. Overall performance of the GPU-accelerated Incomplete Cholesky (IC) factorization preconditioned CG method can outperform its CPU counterpart by a smaller factor, up to 3, and GPU-accelerated The incomplete LU (ILU) factorization preconditioned GMRES method can achieve a speed-up nearing 4. However, with better suited preconditioning techniques for GPUs, this performance can be further improved.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Multicore bundle adjustment

Changchang Wu, +3 more

TL;DR: The design and implementation of new inexact Newton type Bundle Adjustment algorithms that exploit hardware parallelism for efficiently solving large scale 3D scene reconstruction problems and show that overcoming the severe memory and bandwidth limitations of current generation GPUs not only leads to more space efficient algorithms, but also to surprising savings in runtime.

...read moreread less

Proceedings ArticleDOI

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication

Weifeng Liu, +1 more

TL;DR: CSR5 (Compressed Sparse Row 5), a new storage format, which offers high-throughput SpMV on various platforms including CPUs, GPUs and Xeon Phi, is proposed for real-world applications such as a solver with only tens of iterations because of its low-overhead for format conversion.

...read moreread less

Journal ArticleDOI

Fine-Grained Parallel Incomplete LU Factorization

Edmond Chow, +1 more

- 19 Mar 2015 -

SIAM Journal on Scientific Computing

TL;DR: Numerical tests show that very few sweeps are needed to construct a factorization that is an effective preconditioner, and the amount of parallelism is large irrespective of the ordering of the matrix, and matrix ordering can be used to enhance the accuracy of the factorization rather than to increase parallelism.

...read moreread less

Posted Content

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication

Weifeng Liu, +1 more

- 17 Mar 2015 -

arXiv: Mathematical Software

TL;DR: In this article, the authors proposed CSR5 (Compressed Sparse Row 5), a new storage format, which offers high-throughput SpMV on various platforms including CPUs, GPUs and Xeon Phi.

...read moreread less

Journal ArticleDOI

Sparse Matrix-Vector Multiplication on GPGPUs

Salvatore Filippone, +3 more

- 09 Jan 2017 -

ACM Transactions on Mathematical Softwar...

TL;DR: This article provides a review of the techniques for implementing the SpMV kernel on GPGPUs that have appeared in the literature of the last few years, and discusses the issues and tradeoffs that have been encountered by the various researchers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Book

Iterative Methods for Sparse Linear Systems

Yousef Saad

TL;DR: This chapter discusses methods related to the normal equations of linear algebra, and some of the techniques used in this chapter were derived from previous chapters of this book.

...read moreread less

Journal ArticleDOI

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

Youcef Saad, +1 more

- 01 Jul 1986 -

Siam Journal on Scientific and Statistic...

TL;DR: An iterative method for solving linear systems, which has the property of minimizing at every step the norm of the residual vector over a Krylov subspace.

...read moreread less

Journal ArticleDOI

An iteration method for the solution of the eigenvalue problem of linear differential and integral operators

Cornelius Lanczos

- 01 Oct 1950 -

Journal of research of the National Bure...

TL;DR: In this article, a systematic method for finding the latent roots and principal axes of a matrix, without reducing the order of the matrix, has been proposed, which is characterized by a wide field of applicability and great accuracy, since the accumulation of rounding errors is avoided, through the process of minimized iterations.

...read moreread less

Journal ArticleDOI

The university of Florida sparse matrix collection

Timothy A. Davis, +1 more

- 07 Dec 2011 -

ACM Transactions on Mathematical Softwar...

TL;DR: The University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications, is described and a new multilevel coarsening scheme is proposed to facilitate this task.

...read moreread less

Book

Interpolation and approximation

Philip J. Davis

Collapse

GPU-accelerated preconditioned iterative linear solvers

Citations

Multicore bundle adjustment

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication

Fine-Grained Parallel Incomplete LU Factorization

CSR5: An Efficient Storage Format for Cross-Platform Sparse Matrix-Vector Multiplication

Sparse Matrix-Vector Multiplication on GPGPUs

References

Iterative Methods for Sparse Linear Systems

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

An iteration method for the solution of the eigenvalue problem of linear differential and integral operators

The university of Florida sparse matrix collection

Interpolation and approximation

Related Papers (5)

The university of Florida sparse matrix collection

Iterative Methods for Sparse Linear Systems

Implementing sparse matrix-vector multiplication on throughput-oriented processors

Ecient Sparse Matrix-Vector Multiplication on CUDA

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems