scispace - formally typeset
Open AccessProceedings ArticleDOI

Exploiting the parallelism of large-scale application-layer networks by adaptive GPU-based simulation

Reads0
Chats0
TLDR
A GPU-based simulator engine that performs all steps of large-scale network simulations on a commodity many-core GPU and adapts its configuration at runtime in order to balance parallelism and overheads to achieve high performance for a given network model and scenario is presented.
Abstract
We present a GPU-based simulator engine that performs all steps of large-scale network simulations on a commodity many-core GPU. Overhead is reduced by avoiding unnecessary data transfers between graphics memory and main memory. On the example of a widely deployed peer-to-peer network, we analyze the parallelism in large-scale application-layer networks, which suggests the use of thousands of concurrent processor cores for simulation. The proposed simulator employs the vast number of parallel cores in modern GPUs to exploit the identified parallelism and enables substantial simulation speedup. The simulator adapts its configuration at runtime in order to balance parallelism and overheads to achieve high performance for a given network model and scenario. A performance evaluation for simulations of networks comprising up to one million peers demonstrates a speedup of up to 19.5 compared with an efficient sequential implementation and shows the effectiveness of the runtime adaptation to different network conditions.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

A Survey on Agent-based Simulation Using Hardware Accelerators

TL;DR: In this paper, the authors provide an overview and categorisation of the literature according to the applied techniques for agent-based simulations on hardware accelerators, and sketch directions for future research towards automating the hardware mapping and execution.
Proceedings ArticleDOI

Time Warp on the GPU: Design and Assessment

TL;DR: This work presents the design and implementation of an optimistic fully GPU-based parallel discrete-event simulator based on the Time Warp synchronization algorithm, and shows that in most cases, the increase in parallelism when using optimistic synchronization significantly outweighs the increased overhead for state keeping and rollbacks.
Posted Content

A Survey on Agent-based Simulation using Hardware Accelerators

TL;DR: This survey targets modellers seeking an overview of suitable hardware platforms and execution techniques for a specific simulation model, as well as methodology researchers interested in potential research gaps requiring further exploration.
Proceedings ArticleDOI

Performance Evaluation of Priority Queues for Fine-Grained Parallel Tasks on GPUs

TL;DR: This work performs a performance evaluation of GPU-based priority queue implementations for two applications: discrete-event simulation and parallel A* path searches on grids and presents performance measurements covering linear queue designs, implicit binary heaps, splay trees, and a GPU-specific proposal from the literature.
Proceedings ArticleDOI

Transitioning Spiking Neural Network Simulators to Heterogeneous Hardware

TL;DR: This paper proposes a transition approach for CPU-based SNN simulators to enable the execution on heterogeneous hardware with only limited modifications to an existing simulator code base, and without changes to model code.
References
More filters
Proceedings ArticleDOI

Large Scale Distributed Simulation of p2p Networks

TL;DR: This paper presents a tool for executing large scale simulation of p2p systems which scale effectively, only limited by the amount of computational resource available (memory and CPU) through the application of parallel discrete event simulation techniques to an existing, already scalable simulator, peersim.
Proceedings ArticleDOI

GPU accelerated three-stage execution model for event-parallel simulation

TL;DR: This paper introduces the concept of event-parallel discrete event simulation (DES) and its corresponding implementation on the GPU platform and a three-stage execution model composing of generating events, sorting events and processing events in parallel that achieves good speedup.
Proceedings ArticleDOI

Towards a Basic DHT Service: Analyzing Network Characteristics of a Widely Deployed DHT

TL;DR: Whether a basic DHT service could suit the needs of DHT- based applications in terms of stability, number of participating peers, the peers'' session lengths, geographical distribution and peer connectivity when deployed similar to DHTs driven by file sharing is evaluated.
Proceedings ArticleDOI

An expansion-aided synchronous conservative time management algorithm on GPU

TL;DR: An expansion-aided synchronous conservative time management algorithm that helps achieve stable performance under different conditions by finding a balance between low parallelism and unnecessary expansions is proposed.
Related Papers (5)