GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution

doi:10.1145/2503210.2503279

Proceedings ArticleDOI

GoldRush: resource efficient in situ scientific data analytics using fine-grained interference aware execution

Fang Zheng, +7 more

- pp 78

Chats0

TLDR

Experiments with representative science applications at large scales show that resources harvested on compute nodes can be leveraged to perform useful analytics, significantly improving resource efficiency, reducing data movement costs incurred by alternate solutions, and posing negligible impact on scientific simulations.

Abstract:

Severe I/O bottlenecks on High End Computing platforms call for running data analytics in situ. Demonstrating that there exist considerable resources in compute nodes un-used by typical high end scientific simulations, we leverage this fact by creating an agile runtime, termed GoldRush, that can harvest those otherwise wasted, idle resources to efficiently run in situ data analytics. GoldRush uses fine-grained scheduling to "steal" idle resources, in ways that minimize interference between the simulation and in situ analytics. This involves recognizing the potential causes of on-node resource contention and then using scheduling methods that prevent them. Experiments with representative science applications at large scales show that resources harvested on compute nodes can be leveraged to perform useful analytics, significantly improving resource efficiency, reducing data movement costs incurred by alternate solutions, and posing negligible impact on scientific simulations.

Citations

PDF

Open Access

More filters

Fast parallel algorithms for short-range molecular dynamics

Steven J. Plimpton

TL;DR: Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems.

...read moreread less

Proceedings ArticleDOI

In-situ feature extraction of large scale combustion simulations using segmented merge trees

Aaditya G. Landge, +6 more

TL;DR: This paper presents two variants of in-situ feature extraction techniques using segmented merge trees, which encode a wide range of threshold based features, and presents a fast, low communication cost technique that generates an exact solution but has limited scalability.

...read moreread less

Proceedings ArticleDOI

Performance analysis, design considerations, and applications of extreme-scale in situ infrastructures

Utkarsh Ayachit, +20 more

TL;DR: This paper examines several key design and performance issues related to the idea of in situ processing at extreme scale on modern platforms: scalability, overhead, performance measurement and analysis, comparison and contrast with a traditional post hoc approach, and interfacing with simulation codes.

...read moreread less

Proceedings ArticleDOI

Smart: a MapReduce-like framework for in-situ scientific analytics

Yi Wang, +3 more

TL;DR: A novel MapReduce-like framework that supports efficient in-situ scientific analytics, and is designed so that analytics can be launched from the parallel code region of a simulation program.

...read moreread less

Proceedings ArticleDOI

Achieving Performance Isolation with Lightweight Co-Kernels

Jiannan Ouyang, +3 more

TL;DR: Pisces is presented, a system software architecture that enables the co-existence of multiple independent and fully isolated OS/Rs, or enclaves, that can be customized to address the disparate requirements of next generation HPC workloads.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Journal ArticleDOI

Fast parallel algorithms for short-range molecular dynamics

Steven J. Plimpton

- 01 Mar 1995 -

Journal of Computational Physics

TL;DR: In this article, three parallel algorithms for classical molecular dynamics are presented, which can be implemented on any distributed-memory parallel machine which allows for message-passing of data between independently executing processors.

...read moreread less

Fast parallel algorithms for short-range molecular dynamics

Steven J. Plimpton

TL;DR: Comparing the results to the fastest reported vectorized Cray Y-MP and C90 algorithm shows that the current generation of parallel machines is competitive with conventional vector supercomputers even for small problems.

...read moreread less

Proceedings ArticleDOI

Condor-a hunter of idle workstations

M. Litzkow, +2 more

TL;DR: The design, implementation, and performance of the Condor scheduling system, which operates in a workstation environment, are presented and a performance profile of the system is presented that is based on data accumulated from 23 stations during one month.

...read moreread less

Journal ArticleDOI

Addressing shared resource contention in multicore processors via scheduling

Sergey Zhuravlev, +2 more

TL;DR: This study is the first to provide a comprehensive analysis of contention-mitigating techniques that use only scheduling, and finds a classification scheme that addresses not only contention for cache space, but contention for other shared resources, such as the memory controller, memory bus and prefetching hardware.

...read moreread less

Proceedings ArticleDOI

Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

Rolf Rabenseifner, +2 more

TL;DR: It is shown that machine topology has a signi¿cant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future.

...read moreread less