Implicitly parallel programming models for thousand-core microprocessors

doi:10.1145/1278480.1278669

Proceedings ArticleDOI

Implicitly parallel programming models for thousand-core microprocessors

- pp 754-759

TLDR

It is argued that implicitly parallel programming models are critical for addressing the software development crises and software scalability challenges for many-core microprocessors.

Abstract:

This paper argues for an implicitly parallel programming model for many-core microprocessors, and provides initial technical approaches towards this goal. In an implicitly parallel programming model, programmers maximize algorithm- level parallelism, express their parallel algorithms by asserting high-level properties on top of a traditional sequential programming language, and rely on parallelizing compilers and hardware support to perform parallel execution under the hood. In such a model, compilers and related tools require much more advanced program analysis capabilities and programmer assertions than what are currently available so that a comprehensive understanding of the input program's concurrency can be derived. Such an understanding is then used to drive automatic or interactive parallel code generation tools for a diverse set of parallel hardware organizations. The chip-level architecture and hardware should maintain parallel execution state in such a way that a strictly sequential execution state can always be derived for the purpose of verifying and debugging the program. We argue that implicitly parallel programming models are critical for addressing the software development crises and software scalability challenges for many-core microprocessors.

Implicitly parallel programming models for thousand-core microprocessors

Citations

A performance study of general-purpose applications on graphics processors using CUDA

DMP: deterministic shared memory multiprocessing

Extending Amdahl's Law for Energy-Efficient Computing in the Many-Core Era

CHIPPER: A low-complexity bufferless deflection router

Auto-generation and auto-tuning of 3D stencil codes on GPU clusters

References

Cramming More Components Onto Integrated Circuits

Cramming More Components onto Integrated Circuits

Parallel Computer Architecture: A Hardware/Software Approach

StreamIt: A Language for Streaming Applications

The spec# programming system: an overview

Related Papers (5)

LLVM: a compilation framework for lifelong program analysis & transformation

The Landscape of Parallel Computing Research: A View from Berkeley

Multiscalar processors

Optimizing Compilers for Modern Architectures: A Dependence-based Approach

Pin: building customized program analysis tools with dynamic instrumentation