A Tool Suite for Simulation Based Analysis of Memory Access Behavior

doi:10.1007/978-3-540-24688-6_58

Open AccessBook ChapterDOI

A Tool Suite for Simulation Based Analysis of Memory Access Behavior

- pp 440-447

TLDR

An execution driven cache simulator which relates event metrics to a dynamically built-up call-graph, and a graphical front end able to visualize the generated data in various ways are presented.

Abstract:

In this paper, two tools are presented: an execution driven cache simulator which relates event metrics to a dynamically built-up call-graph, and a graphical front end able to visualize the generated data in various ways. To get a general purpose, easy-to-use tool suite, the simulation approach allows us to take advantage of runtime instrumentation, i.e. no preparation of application code is needed, and enables for sophisticated preprocessing of the data already in the simulation phase. In an ongoing project, research on advanced cache analysis is based on these tools. Taking a multigrid solver as an example, we present the results obtained from the cache simulation together with real data measured by hardware performance counters.

Citations

PDF

Open Access

More filters

Journal ArticleDOI

Fast parallel image registration on CPU and GPU for diagnostic classification of Alzheimer's disease.

Denis P Shamonin, +5 more

- 01 Jan 2013 -

Frontiers in Neuroinformatics

TL;DR: The accelerated registration tool elastix is employed in a study on diagnostic classification of Alzheimer's disease and cognitively normal controls based on T1-weighted MRI and has nearly identical results to the non-optimized version.

...read moreread less

Journal ArticleDOI

DynamO : a free O(N) general event-driven molecular dynamics simulator

Marcus N. Bannerman, +2 more

- 30 Nov 2011 -

Journal of Computational Chemistry

TL;DR: DynamO is presented, a general event‐driven simulation package, which displays the optimal ${\cal O}$(N) asymptotic scaling of the computational cost with the number of particles N, rather than the standard scaling found in most standard algorithms.

...read moreread less

Proceedings ArticleDOI

State of the Art of Performance Visualization

Katherine E. Isaacs, +7 more

TL;DR: This work discusses performance as it relates to visualization and survey existing approaches in performance visualization and develops a taxonomy for the contexts in which different performance visualizations reside and describes the state of the art research pertaining to each.

...read moreread less

Journal ArticleDOI

A Cache-Aware Algorithm for PDEs on Hierarchical Data Structures Based on Space-Filling Curves

Frank Gu¨nther, +3 more

- 01 Sep 2006 -

SIAM Journal on Scientific Computing

TL;DR: Data access becomes very fast—even faster than the common access to nonhierarchical data stored in matrices—and, in particular, cache misses are reduced considerably.

...read moreread less

Proceedings ArticleDOI

Input-sensitive profiling

Emilio Coppa, +2 more

TL;DR: A building block technique and a toolkit towards automatic discovery of workload-dependent performance bottlenecks that other profilers may fail to detect and can provide useful characterizations of the workload and behavior of individual routines in the context of mainstream applications are presented.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings ArticleDOI

Gprof: A call graph execution profiler

Susan L. Graham, +2 more

TL;DR: The gprof profiler accounts for the running time of called routines in therunning time of the routines that call them, and the design and use of this profiler is described.

...read moreread less

Proceedings ArticleDOI

ATOM: a system for building customized program analysis tools

Amitabh Srivastava, +1 more

TL;DR: ATOM as mentioned in this paper is a single framework for building a wide range of customized program analysis tools, including block counting, profiling, dynamic memory recording, instruction and data cache simulation, pipeline simulation, evaluating branch prediction, and instruction scheduling.

...read moreread less

Journal ArticleDOI

The Paradyn parallel performance measurement tool

Barton P. Miller, +7 more

- 01 Nov 1995 -

IEEE Computer

TL;DR: Dynamic instrumentation lets us defer insertion until the moment it is needed (and remove it when it is no longer needed); Paradyn's Performance Consultant decides when and where to insert instrumentation.

...read moreread less

Proceedings ArticleDOI

Shade: a fast instruction-set simulator for execution profiling

Bob Cmelik, +1 more

TL;DR: A tool called Shade is described which combines efficient instruction-set simulation with a flexible, extensible trace generation capability and discusses instruction set emulation in general.

...read moreread less

Journal ArticleDOI

A Portable Programming Interface for Performance Evaluation on Modern Processors

Shirley Browne, +4 more

TL;DR: The purpose of the PAPI project is to specify a standard application programming interface for accessing hardware performance counters available on most modern microprocessors, which exist as a small set of registers that count events.

...read moreread less

A Tool Suite for Simulation Based Analysis of Memory Access Behavior

Citations

Fast parallel image registration on CPU and GPU for diagnostic classification of Alzheimer's disease.

DynamO : a free O(N) general event-driven molecular dynamics simulator

State of the Art of Performance Visualization

A Cache-Aware Algorithm for PDEs on Hierarchical Data Structures Based on Space-Filling Curves

Input-sensitive profiling

References

Gprof: A call graph execution profiler

ATOM: a system for building customized program analysis tools

The Paradyn parallel performance measurement tool

Shade: a fast instruction-set simulator for execution profiling

A Portable Programming Interface for Performance Evaluation on Modern Processors

Related Papers (5)

Valgrind: a framework for heavyweight dynamic binary instrumentation

Pin: building customized program analysis tools with dynamic instrumentation

Using Valgrind to detect undefined value errors with bit-precision

Gprof: A call graph execution profiler

Exploiting hardware performance counters with flow and context sensitive profiling