scispace - formally typeset
Open AccessJournal ArticleDOI

A block-asynchronous relaxation method for graphics processing units

TLDR
This paper develops asynchronous iteration algorithms in CUDA and compares them with parallel implementations of synchronous relaxation methods on CPU- or GPU-based systems and identifies the high potential of the asynchronous methods for Exascale computing.
About
This article is published in Journal of Parallel and Distributed Computing.The article was published on 2013-12-01 and is currently open access. It has received 28 citations till now. The article focuses on the topics: Asynchronous communication & CUDA.

read more

Citations
More filters
Book ChapterDOI

GPU-accelerated asynchronous error correction for mixed precision iterative refinement

TL;DR: A mixed precision iterative refinement algorithm is derived using a block-asynchronous iteration as an error correction solver, and its performance is compared with a pure implementation of a blocksynchronous iteration and an iteratives refinement method using double precision for the error Correction solver.
Book ChapterDOI

Domain Overlap for Iterative Sparse Triangular Solves on GPUs

TL;DR: It is demonstrated that moderate directed overlap with subdomain scheduling can improve convergence and time-to-solution for sparse triangular factors from incomplete factorizations.
Book ChapterDOI

Weighted block-asynchronous iteration on GPU-Accelerated systems

TL;DR: This paper introduces different weighting techniques similar to those applied in block-smoothers for multigrid methods and observes that using weights may accelerate the convergence rate of block-asynchronous iteration considerably.
Journal ArticleDOI

GPU implementation of Jacobi Method and Gauss-Seidel Method for Data Arrays that Exceed GPU-dedicated Memory Size

TL;DR: A predictive mathematical model for computation duration is proposed that allows to find optimal algorithm parameters and computation experiment results have shown the model to be accurate enough to predict optimal algorithms parameters.
Journal ArticleDOI

Reducing the effect of global synchronization in delayed gradient methods for symmetric linear systems

TL;DR: This paper explores the cyclic formulations of lagged gradient methods and s-dimensional methods for reducing global synchronizations and concludes that these formulations perform better than traditional methods in view of both iteration count and computing time.
References
More filters
Book

Iterative Methods for Sparse Linear Systems

Yousef Saad
TL;DR: This chapter discusses methods related to the normal equations of linear algebra, and some of the techniques used in this chapter were derived from previous chapters of this book.
Journal ArticleDOI

GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems

TL;DR: An iterative method for solving linear systems, which has the property of minimizing at every step the norm of the residual vector over a Krylov subspace.
Book

Iterative Methods for Linear and Nonlinear Equations

C. T. Kelley
TL;DR: Preface How to Get the Software How to get the Software Part I.
Journal ArticleDOI

Asynchronous Iterative Methods for Multiprocessors

TL;DR: A class of asynchronous iterative methods is presented for solving a system of equations corresponding to a parallel implementation on a multiprocessor system with no synchronization between cooperating processes to show clearly the advantage of purely asynchronous Iterative methods.