Solving Lattice QCD systems of equations using mixed precision solvers on GPUs
Citations
962 citations
Cites methods from "Solving Lattice QCD systems of equa..."
...Table 3 lists some representative applications along with the runtime speedups obtained for the whole application using CPUþGPU coprocessing over CPU alone, as measured by application developers.(12-22) The speedups using GeForce 8800, Tesla T8, GeForce GTX 280, Tesla T10, and GeForce GTX 285 range from 9 to more than 130 , with the higher speedups reflecting applications where more of the work ran in parallel on the GPU....
[...]
221 citations
Cites methods from "Solving Lattice QCD systems of equa..."
...the single-particle spectrum of charmonium up to 4:5 GeV is a timely contribution to this eort. Acknowledgments We thank our colleagues within the Hadron Spectrum Collaboration. Chroma [46] and QUDA [47,48] were used to perform this work on the Lonsdale cluster maintained by the Trinity Centre for High Performance Computing funded through grants from Science Foundation Ireland (SFI), at the SFI/HEA Iris...
[...]
169 citations
146 citations
Cites methods from "Solving Lattice QCD systems of equa..."
...Part of this work used the CUDA GPU implementation of a mixed-precision iterative linear system solver for the Dirac equation by Michael Clark and Ronald Babich [19]....
[...]
109 citations
References
795 citations
"Solving Lattice QCD systems of equa..." refers background or methods in this paper
...On the GTX 280, the library in [8] could achieve up to 30 Gflops of sustained single precision performance for matrices with similar structure as the one discussed here....
[...]
...) for which sparse matrix–vector GPU libraries are available [8]....
[...]
597 citations
"Solving Lattice QCD systems of equa..." refers methods in this paper
...[19] R....
[...]
...The linear solver developed in this work has become the mainstay of our open source QUDA library [18], which we have interfaced to the common lattice QCD packages (Chroma [19, 20], CPS [21], QDP/C [22]) for easy integration with current QCD calculations....
[...]
457 citations
"Solving Lattice QCD systems of equa..." refers methods in this paper
...On first generation CUDA devices this poses a problem since double precision is not implemented, so schemes such as Kahan summation [13] are required to reduce the accumulation of errors....
[...]
140 citations
108 citations
"Solving Lattice QCD systems of equa..." refers background or methods in this paper
...The cure advocated in [5] is that of reliable updates: here a parameter δ is introduced, and if the magnitude of the iterated residual decreases by δ compared to the magnitude of all previous residuals, the iterated residual is replaced by the true residual....
[...]
...9 Here we have simplified the approach given in [5] such that we perform a reliable residual update whenever the norm of the residual decreases by a factor δ relative to the maximum of the residual since the last update....
[...]
...Residual drift and possible cures have been studied previously in different contexts [5], namely where the drift is caused by the erratic convergence of BiCGstab which induces rounding errors....
[...]
...In this work we introduce a new method for using mixed precision in the context of Krylov solvers, repurposing the reliable updates scheme of [5]....
[...]