Managing performance vs. accuracy trade-offs with loop perforation

doi:10.1145/2025113.2025133

Open AccessProceedings ArticleDOI

Managing performance vs. accuracy trade-offs with loop perforation

Stelios Sidiroglou-Douskos, +3 more

- pp 124-134

Chats0

TLDR

The results indicate that, for a range of applications, this approach typically delivers performance increases of over a factor of two (and up to a factors of seven) while changing the result that the application produces by less than 10%.

Abstract:

Many modern computations (such as video and audio encoders, Monte Carlo simulations, and machine learning algorithms) are designed to trade off accuracy in return for increased performance. To date, such computations typically use ad-hoc, domain-specific techniques developed specifically for the computation at hand. Loop perforation provides a general technique to trade accuracy for performance by transforming loops to execute a subset of their iterations. A criticality testing phase filters out critical loops (whose perforation produces unacceptable behavior) to identify tunable loops (whose perforation produces more efficient and still acceptably accurate computations). A perforation space exploration algorithm perforates combinations of tunable loops to find Pareto-optimal perforation policies. Our results indicate that, for a range of applications, this approach typically delivers performance increases of over a factor of two (and up to a factor of seven) while changing the result that the application produces by less than 10%.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Approximating Beyond the Processor: Exploring Full-System Energy-Accuracy Tradeoffs in a Smart Camera System

Arnab Raha, +1 more

- 02 Nov 2018 -

IEEE Transactions on Very Large Scale In...

TL;DR: A systematic methodology to perform joint approximations across different subsystems, leading to significant energy benefits compared to approximating individual subsystems in isolation is proposed.

...read moreread less

Proceedings ArticleDOI

Coupling proofs are probabilistic product programs

Gilles Barthe, +3 more

- 12 Jul 2016 -

arXiv: Programming Languages

TL;DR: An extension of pRHL is defined, called x-pRHL, which explicitly constructs the coupling in a pRH derivation in the form of a probabilistic product program that simulates two correlated runs of the original program.

...read moreread less

Proceedings ArticleDOI

Crayon: saving power through shape and color approximation on next-generation displays

Phillip Stanley-Marbell, +2 more

TL;DR: The results show that Crayon's color transforms can reduce display power dissipation by over 66% while producing images that remain visually acceptable to users, and the measured whole-system power reduction is approximately 50%.

...read moreread less

Proceedings Article

MEANTIME: achieving both minimal energy and timeliness with approximate computing

Anne F. Farrell, +1 more

TL;DR: This paper proposes MEANTIME: a runtime system that delivers hard latency guarantees and energy-minimal resource usage through small accuracy reductions and finds that MEantIME never violates real-time deadlines and sacrifices a small amount of accuracy while reducing energy to 54% of a conservative, full accuracy approach.

...read moreread less

Proceedings ArticleDOI

Conditionally correct superoptimization

Rahul Sharma, +3 more

TL;DR: This work combines abstract interpretation, decision procedures, and testing to yield a verification strategy that yields a superoptimizer for x86 that in the experiments produces binaries that are often multiple times faster than those produced by production compilers.

...read moreread less

Collapse

References

PDF

Open Access

More filters

UCI Machine Learning Repository

A. Asuncion

Journal ArticleDOI

A K-Means Clustering Algorithm

J. A. Hartigan, +1 more

- 01 Mar 1979 -

Journal of The Royal Statistical Society...

Proceedings ArticleDOI

LLVM: a compilation framework for lifelong program analysis & transformation

Chris Lattner, +1 more

TL;DR: The design of the LLVM representation and compiler framework is evaluated in three ways: the size and effectiveness of the representation, including the type information it provides; compiler performance for several interprocedural problems; and illustrative examples of the benefits LLVM provides for several challenging compiler problems.

...read moreread less

Journal ArticleDOI

The JPEG still picture compression standard

Gregory K. Wallace

- 01 Apr 1991 -

Communications of The ACM

TL;DR: The Baseline method has been by far the most widely implemented JPEG method to date, and is sufficient in its own right for a large number of applications.

...read moreread less

Proceedings ArticleDOI

The PARSEC benchmark suite: characterization and architectural implications

Christian Bienia, +3 more

TL;DR: This paper presents and characterizes the Princeton Application Repository for Shared-Memory Computers (PARSEC), a benchmark suite for studies of Chip-Multiprocessors (CMPs), and shows that the benchmark suite covers a wide spectrum of working sets, locality, data sharing, synchronization and off-chip traffic.

...read moreread less

Collapse

Managing performance vs. accuracy trade-offs with loop perforation

Citations

Approximating Beyond the Processor: Exploring Full-System Energy-Accuracy Tradeoffs in a Smart Camera System

Coupling proofs are probabilistic product programs

Crayon: saving power through shape and color approximation on next-generation displays

MEANTIME: achieving both minimal energy and timeliness with approximate computing

Conditionally correct superoptimization

References

UCI Machine Learning Repository

A K-Means Clustering Algorithm

LLVM: a compilation framework for lifelong program analysis & transformation

The JPEG still picture compression standard

The PARSEC benchmark suite: characterization and architectural implications

Related Papers (5)

EnerJ: approximate data types for safe and general low-power computation

Green: a framework for supporting energy-conscious programming using controlled approximation

Neural Acceleration for General-Purpose Approximate Programs

Architecture support for disciplined approximate programming

Dynamic knobs for responsive power-aware computing