Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

A framework to enable applications executing within virtual machines to transparently share one or more GPUs is presented and it is found that even when contention is high the consolidation algorithm is effective in improving the throughput, and that the runtime overhead of the framework is low.

Abstract:

Driven by the emergence of GPUs as a major player in high performance computing and the rapidly growing popularity of cloud environments, GPU instances are now being offered by cloud providers. The use of GPUs in a cloud environment, however, is still at initial stages, and the challenge of making GPU a true shared resource in the cloud has not yet been addressed.This paper presents a framework to enable applications executing within virtual machines to transparently share one or more GPUs. Our contributions are twofold: we extend an open source GPU virtualization software to include efficient GPU sharing, and we propose solutions to the conceptual problem of GPU kernel consolidation. In particular, we introduce a method for computing the affinity score between two or more kernels, which provides an indication of potential performance improvements upon kernel consolidation. In addition, we explore molding as a means to achieve efficient GPU sharing also in the case of kernels with high or conflicting resource requirements. We use these concepts to develop an algorithm to efficiently map a set of kernels on a pair of GPUs. We extensively evaluate our framework using eight popular GPU kernels and two Fermi GPUs. We find that even when contention is high our consolidation algorithm is effective in improving the throughput, and that the runtime overhead of our framework is low.

Citations

PDF

Open Access

More filters

Book ChapterDOI

Confidentiality issues on a GPU in a virtualized environment

Clémentine Maurice,Christoph Neumann,Olivier Heen,Aurélien Francillon +3 moreInstitut Eurécom

Show Less

TL;DR: The objective is to highlight possible information leakage due to GPUs in virtualized and cloud computing environments, and provides insight into the different GPU virtualization techniques, along with their security implications.

...read moreread less

Proceedings ArticleDOI

Constructing and characterizing covert channels on GPGPUs

Hoda Naghibijouybari,Khaled N. Khasawneh,Nael Abu-Ghazaleh +2 moreUniversity of California, Riverside

Show Less

TL;DR: A first study of covert channel attacks on GPGPUs is presented, obtaining error-free bandwidth of over 4 Mbps, making it the fastest known microarchitectural covert channel under realistic conditions.

...read moreread less

Journal ArticleDOI

Parallel map projection of vector-based big spatial data: Coupling cloud computing with graphics processing units

Wenwu Tang,Wenpeng Feng +1 moreUniversity of North Carolina at Charlotte

- 01 Jan 2017 -

Computers, Environment and Urban Systems

Show Less

TL;DR: The parallel map projection framework presented in this study is based on a layered architecture that couples capabilities of cloud computing and high-performance computing accelerated by Graphics Processing Units and provides considerable acceleration for re-projecting vector-based big spatial data.

...read moreread less

Proceedings ArticleDOI

Interference-driven resource management for GPU-based heterogeneous clusters

Rajat Phull,Cheng-Hong Li,Kunal Rao,Hari Cadambi,Srimat Chakradhar +4 morePrinceton University

Show Less

TL;DR: A framework to predict and handle interference when two or more jobs time-share GPUs in HPC clusters is presented, which consists of an analysis model, and a dynamic interference detection and response mechanism to detect excessive interference and restart the interfering jobs on different nodes.

...read moreread less

Journal ArticleDOI

Multimedia Processing Pricing Strategy in GPU-Accelerated Cloud Computing

He Li,Kaoru Ota,Mianxiong Dong,Athanasios V. Vasilakos,Koji Nagano +4 moreMuroran Institute of Technology

- 01 Oct 2020 -

IEEE Transactions on Cloud Computing

Show Less

TL;DR: This paper proposes an optimal pricing strategy of GPU-accelerated multimedia processing services for maximizing the profits of both the cloud provider and users and finds the optimal solutions of bothThe cloud provider's and users’ profit functions.

...read moreread less

…
1
2
3
4
5
6
7
…
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27

Collapse

David Tarditi,Sidd Puri,Jose M. Oglesby +2 moreMicrosoft

Show Less

TL;DR: This work describes Accelerator, a system that uses data parallelism to program GPUs for general-purpose uses instead of C, and compares the performance of Accelerator versions of the benchmarks against hand-written pixel shaders.

...read moreread less

1
2
3
4
…
5
6
7

Collapse

IEEE Transactions on Computers

Show Less

SciSpace

About Careers Resources Support Browse Papers Pricing SciSpace Affiliate Program Cancellation & Refund Policy Terms Privacy

Tools

Citation generator AI Detector Paraphraser Citation Booster

Extensions

SciSpace

Directories

Papers Topics Journals Authors Conferences Institutions Questions Citation Styles

Contact

support@typeset.io +91 8431021544

Supporting GPU sharing in cloud environments with a transparent runtime consolidation framework

Citations

Confidentiality issues on a GPU in a virtualized environment

Constructing and characterizing covert channels on GPGPUs

Parallel map projection of vector-based big spatial data: Coupling cloud computing with graphics processing units

Interference-driven resource management for GPU-based heterogeneous clusters

Multimedia Processing Pricing Strategy in GPU-Accelerated Cloud Computing

References

The cost of doing science on the cloud: the Montage example

Qilin: exploiting parallelism on heterogeneous multiprocessors with adaptive mapping

Automated control of multiple virtualized resources

Cost-benefit analysis of Cloud Computing versus desktop grids

Accelerator: using data parallelism to program GPUs for general-purpose uses

Related Papers (5)

GViM: GPU-accelerated virtual machines

A GPGPU transparent virtualization component for high performance computing clouds

rCUDA: Reducing the number of GPU-based accelerators in high performance clusters

Rodinia: A benchmark suite for heterogeneous computing

vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines