Performance and power evaluation of an in-line accelerator

doi:10.1145/1787275.1787293

Proceedings ArticleDOI

Performance and power evaluation of an in-line accelerator

Alejandro Rico, +5 more

- pp 81-82

Chats0

TLDR

In this paper, a processor-attached in-line accelerator provides high-performance SIMD computing and power efficiency by means of a very large register file and a set of vector multimedia extensions based on IBM's PowerPC VMX.

Abstract:

In this paper we evaluate the performance and power of a processor-attached in-line accelerator. The accelerator provides high-performance SIMD computing and power efficiency by means of a very large register file and a set of vector multimedia extensions based on IBM's PowerPC VMX. Our experiments show significant performance improvements and power reduction, compared to a baseline vector execution unit, mainly due to the drastic decrease of memory accesses caused by the software-managed locality of the very large register file. Total execution time is, on average, reduced by 61%, while consuming 55% less energy.

Citations

PDF

Open Access

More filters

Proceedings ArticleDOI

Architectural perspectives of future wireless base stations based on the IBM PowerEN™ processor

Augusto Vega, +6 more

TL;DR: The applicability and potential benefits of the IBM PowerEN processor in the realm of base stations for the 3G and 4G standards are studied, and the in-line universal accelerator and the PIR strategy focusing on two specific applications for base stations are evaluated.

...read moreread less

Dissertation

Performance and power optimizations in chip multiprocessors for throughput-aware computation

Augusto Javier Vega

TL;DR: This thesis presents innovations to improve bandwidth and power consumption in chip multiprocessors (CMPs) for throughput-aware computation: a bandwidth-optimized last-level cache (LLC), an bandwidth- Optimized vector register file, and a power/performance-aware thread placement heuristic.

...read moreread less

Dissertation

Raising the level of abstraction : simulation of large chip multiprocessors running multithreaded applications

Alejandro Rico Carro

TL;DR: This thesis proposes a simulation methodology that employs a trace-driven simulator together with a runtime sytem that allows the proper simulation of multithreaded applications by reproducing the timing-dependent dynamic behavior at simulation time.

...read moreread less

Raising the Level of Abstraction: Simulation of Large Chip

Multiprocessors Running

References

PDF

Open Access

More filters

Proceedings ArticleDOI

A High-Performance SIMD Floating Point Unit for BlueGene/L: Architecture, Compilation, and Algorithm Design

Leonardo Bachega, +10 more

TL;DR: Preliminary performance data shows that the algorithm-compiler-hardware combination delivers a significant fraction of peak floating-point performance for compute-bound kernels such as matrix multiplication, and delivery of peak memory bandwidth for memory-bound kernel such as daxpy, while being largely insensitive to data alignment.

...read moreread less

Proceedings ArticleDOI

VICTORIA: VMX indirect compute technology oriented towards in-line acceleration

Jeff H. Derby, +2 more

TL;DR: The VICTORIA PowerPC architecture is described, which is based on the iVMX accelerator technology, which extends the existing VMX architecture with indirect register addressing and opens the door for highly optimized vector algorithms that can sustain very high processing rates.

...read moreread less

Evaluation of power consumption at execution of multiple automatically parallelized and power controlled media applications on the RP2 low-power multicore

Hiroki Mikami, +7 more

Performance and power evaluation of an in-line accelerator

Citations

Architectural perspectives of future wireless base stations based on the IBM PowerEN™ processor

Performance and power optimizations in chip multiprocessors for throughput-aware computation

Raising the level of abstraction : simulation of large chip multiprocessors running multithreaded applications

Raising the Level of Abstraction: Simulation of Large Chip

References

A High-Performance SIMD Floating Point Unit for BlueGene/L: Architecture, Compilation, and Algorithm Design

VICTORIA: VMX indirect compute technology oriented towards in-line acceleration

Related Papers (5)

Power optimizations for transport triggered SIMD processors

Data Compression Accelerator on IBM POWER9 and z15 Processors : Industrial Product

Design of the PowerPC 604e microprocessor

Evaluation of power consumption at execution of multiple automatically parallelized and power controlled media applications on the RP2 low-power multicore

IBM POWER8 processor core microarchitecture