Topic

SPMD

About: SPMD is a research topic. Over the lifetime, 928 publications have been published within this topic receiving 12882 citations. The topic is also known as: single program, multiple data.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

[...]

Rolf Rabenseifner, Georg Hager, Gabriele Jost

18 Feb 2009

TL;DR: It is shown that machine topology has a signi¿cant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future.

...read moreread less

Abstract: Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming models on hierarchically structured hardware: Pure MPI (Message Passing Interface), pure OpenMP (with distributed shared memory extensions) and hybrid MPI+OpenMP in several ?avors. We pinpoint cases where a hybrid programming model can indeed be the superior solution because of reduced communication needs and memory consumption, or improved load balance. Furthermore we show that machine topology has a signi?cant impact on performance for all parallelization strategies and that topology awareness should be built into all applications in the future. Finally we give an outlook on possible standardization goals and extensions that could make hybrid programming easier to do with performance in mind.

...read moreread less

446 citations

Journal Article•DOI•

Titanium: a high-performance Java dialect

[...]

Katherine Yelick¹, Katherine Yelick², Luigi Semenzato², Luigi Semenzato¹, Geoff Pike¹, Geoff Pike², Carleton Miyamoto², Carleton Miyamoto¹, Ben Liblit¹, Ben Liblit², Arvind Krishnamurthy¹, Arvind Krishnamurthy², Paul N. Hilfinger², Paul N. Hilfinger¹, Susan L. Graham¹, Susan L. Graham², Phillip Colella¹, Phillip Colella², Alex Aiken², Alex Aiken¹ - Show less +16 more•Institutions (2)

University of California, Berkeley¹, Lawrence Berkeley National Laboratory²

01 Sep 1998-Concurrency and Computation: Practice and Experience

TL;DR: This work discusses the main additions to Java are immutable classes, multidimensional arrays, an explicitly parallel SPMD model of computation with a global address space, and zone-based memory management, and reports progress on the development of Titanium.

...read moreread less

Abstract: Titanium is a language and system for high-performance parallel scientific computing. Titanium uses Java as its base, thereby leveraging the advantages of that language and allowing us to focus attention on parallel computing issues. The main additions to Java are immutable classes, multidimensional arrays, an explicitly parallel SPMD model of computation with a global address space, and zone-based memory management. We discuss these features and our design approach, and report progress on the development of Titanium, including our current driving application: a three-dimensional adaptive mesh refinement parallel Poisson solver. © 1998 John Wiley & Sons, Ltd.

...read moreread less

433 citations

Proceedings Article•DOI•

MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes

[...]

George Bosilca¹, Aurelien Bouteiller¹, Franck Cappello¹, Samir Djilali¹, Gilles Fedak¹, Cécile Germain¹, Thomas Herault¹, Pierre Lemarinier¹, Oleg Lodygensky¹, Frédéric Magniette¹, Vincent Neri¹, Anton Selikhov¹ - Show less +8 more•Institutions (1)

University of Paris¹

16 Nov 2002

TL;DR: This work presents MPICH-V, an automatic Volatility tolerant MPI environment based on uncoordinated checkpoint/roll-back and distributed message logging, and presents a detailed performance evaluation of every component and its global performance for non-trivial parallel applications.

...read moreread less

Abstract: Global Computing platforms, large scale clusters and future TeraGRID systems gather thousands of nodes for computing parallel scientific applications. At this scale, node failures or disconnections are frequent events. This Volatility reduces the MTBF of the whole system in the range of hours or minutes. We present MPICH-V, an automatic Volatility tolerant MPI environment based on uncoordinated checkpoint/roll-back and distributed message logging. MPICH-V architecture relies on Channel Memories, Checkpoint servers and theoretically proven protocols to execute existing or new, SPMD and Master-Worker MPI applications on volatile nodes. To evaluate its capabilities, we run MPICH-V within a framework for which the number of nodes, Channels Memories and Checkpoint Servers can be completely configured as well as the node Volatility. We present a detailed performance evaluation of every component of MPICH-V and its global performance for non-trivial parallel applications. Experimental results demonstrate good scalability and high tolerance to node volatility.

...read moreread less

323 citations

Journal Article•DOI•

Supporting dynamic data structures on distributed-memory machines

[...]

Anne Rogers¹, Martin C. Carlisle¹, John Reppy², Laurie Hendren³•Institutions (3)

Princeton University¹, Bell Labs², McGill University³

01 Mar 1995-ACM Transactions on Programming Languages and Systems

TL;DR: An execution model for supporting programs that use pointer-based dynamic data structures is described that uses a simple mechanism for migrating a thread of control based on the layout of heap-allocated data and introduces parallelism using a technique based on futures and lazy task creation.

...read moreread less

Abstract: Compiling for distributed-memory machines has been a very active research area in recent years. Much of this work has concentrated on programs that use arrays as their primary data structures. To date, little work has been done to address the problem of supporting programs that use pointer-based dynamic data structures. The techniques developed for supporting SPMD execution of array-based programs rely on the fact that arrays are statically defined and directly addressable. Recursive data structures do not have these properties, so new techniques must be developed. In this article, we describe an execution model for supporting programs that use pointer-based dynamic data structures. This model uses a simple mechanism for migrating a thread of control based on the layout of heap-allocated data and introduces parallelism using a technique based on futures and lazy task creation. We intend to exploit this execution model using compiler analyses and automatic parallelization techniques. We have implemented a prototype system, which we call Olden, that runs on the Intel iPSC/860 and the Thinking Machines CM-5. We discuss our implementation and report on experiments with five benchmarks.

...read moreread less

265 citations

Proceedings Article•DOI•

Communication optimization and code generation for distributed memory machines

[...]

Saman Amarasinghe, Monica S. Lam

01 Jun 1993

TL;DR: It is shown that the problems of communication code generation, local memory management, message aggregation and redundant data communication elimination can all be solved by projecting polyhedra represented by sets of inequalities onto lower dimensional spaces.

...read moreread less

Abstract: This paper presents several algorithms to solve code generation and optimization problems specific to machines with distributed address spaces. Given a description of how the computation is to be partitioned across the processors in a machine, our algorithms produce an SPMD (single program multiple data) program to be run on each processor. Our compiler generated the necessary receive and send instructions, optimizes the communication by eliminating redundant communication and aggregating small messages into large messages, allocates space locally on each processor, and translates global data addresses to local addresses.Our techniques are based on an exact data-flow analysis on individual array element accesses. Unlike data dependence analysis, this analysis determines if two dynamic instances refer to the same value, and not just to the same location. Using this information, our compiler can handle more flexible data decompositions and find more opportunities for communication optimization than systems based on data dependence analysis.Our technique is based on a uniform framework, where data decompositions, computation decompositions and the data flow information are all represented as systems of linear inequalities. We show that the problems of communication code generation, local memory management, message aggregation and redundant data communication elimination can all be solved by projecting polyhedra represented by sets of inequalities onto lower dimensional spaces.

...read moreread less

241 citations

Collapse

Network Information

Performance

Metrics

958

Papers

13,500

Citations

No. of papers in the topic in previous years
Year	Papers
2023	7
2022	24
2021	7
2020	4
2019	9
2018	20

SPMD

Papers published on a yearly basis

Papers

Trending Questions (1)

Network Information

Related Topics (5)

Performance

Metrics