Delivering Faster Results Through Parallelisation and GPU Acceleration

doi:10.1007/978-3-319-14654-6_19

Book ChapterDOI

Delivering Faster Results Through Parallelisation and GPU Acceleration

- Vol. 591, pp 309-320

TLDR

This paper presents the methods for parallelising two pieces of scientific software, leveraging multiple GPUs to achieve up to thirty times speed up.

Abstract:

The rate of scientific discovery depends on the speed at which accurate results and analysis can be obtained. The use of parallel co-processors such as Graphical Processing Units (GPUs) is becoming more and more important in meeting this demand as improvements in serial data processing speed become increasingly difficult to sustain. However, parallel data processing requires more complex programming compared to serial processing. Here we present our methods for parallelising two pieces of scientific software, leveraging multiple GPUs to achieve up to thirty times speed up.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

IMTeract Tool for Monitoring and Profiling HPC Systems and Applications

Violeta Holmes, +4 more

TL;DR: IMTeract was used for energy usage profiling of HPC clusters running FLUENT and DL-POLY software and a GPU cluster running different implementations of an FFT algorithm, and the experimental results are encouraging and it is suggested that the IMTeract tool can be used to measure the CPU, Memory, Disk I/O and NetworkI/O for an application or a process and report on the energy used.

...read moreread less

References

PDF

Open Access

More filters

Journal ArticleDOI

NVIDIA Tesla: A Unified Graphics and Computing Architecture

Erik Lindholm, +3 more

- 01 Mar 2008 -

IEEE Micro

TL;DR: To enable flexible, programmable graphics and high-performance computing, NVIDIA has developed the Tesla scalable unified graphics and parallel computing architecture, which is massively multithreaded and programmable in C or via graphics APIs.

...read moreread less

Journal ArticleDOI

The GPU Computing Era

John R. Nickolls, +1 more

- 01 Mar 2010 -

IEEE Micro

TL;DR: The rapid evolution of GPU architectures-from graphics processors to massively parallel many-core multiprocessors, recent developments in GPU computing architectures, and how the enthusiastic adoption of CPU+GPU coprocessing is accelerating parallel applications are described.

...read moreread less

Book

Speedup versus efficiency in parallel systems

Derek L. Eager, +2 more

TL;DR: The tradeoff between speedup and efficiency that is inherent to a software system is investigated in this paper, and the extent to which this tradeoff is determined by the average parallelism of the software system, as contrasted with other, more detailed, characterizations, is shown.

...read moreread less

Proceedings ArticleDOI

Accelerator: using data parallelism to program GPUs for general-purpose uses

David Tarditi, +2 more

TL;DR: This work describes Accelerator, a system that uses data parallelism to program GPUs for general-purpose uses instead of C, and compares the performance of Accelerator versions of the benchmarks against hand-written pixel shaders.

...read moreread less

Journal ArticleDOI

Speedup versus efficiency in parallel systems

Derek L. Eager, +2 more

- 01 Mar 1989 -

IEEE Transactions on Computers

TL;DR: The tradeoff between speedup and efficiency that is inherent to a software system is investigated and it is shown that for any software system and any number of processors, the sum of the average processor utilization and the attained fraction of the maximum possible speedup must exceed one.

...read moreread less

Delivering Faster Results Through Parallelisation and GPU Acceleration

Citations

IMTeract Tool for Monitoring and Profiling HPC Systems and Applications

References

NVIDIA Tesla: A Unified Graphics and Computing Architecture

The GPU Computing Era

Speedup versus efficiency in parallel systems

Accelerator: using data parallelism to program GPUs for general-purpose uses

Speedup versus efficiency in parallel systems

Related Papers (5)

GPU Computing with CUDA

Energy-efficient stencil computations on distributed GPUs using dynamic parallelism and GPU-controlled communication

Parallel Computations for Hierarchical Agglomerative Clustering using CUDA Fast and Scalable Computations on Graphics Processors

Chapter 6 – GPU programming

GPU Computing