N
Nathan R. Tallent
Researcher at Pacific Northwest National Laboratory
Publications - 58
Citations - 1961
Nathan R. Tallent is an academic researcher from Pacific Northwest National Laboratory. The author has contributed to research in topics: Computer science & Performance tuning. The author has an hindex of 19, co-authored 52 publications receiving 1742 citations. Previous affiliations of Nathan R. Tallent include Rice University.
Papers
More filters
Journal ArticleDOI
HPCTOOLKIT: tools for performance analysis of optimized parallel programs
Laksono Adhianto,S. Banerjee,Michael Fagan,Mark W. Krentel,Gabriel Marin,John Mellor-Crummey,Nathan R. Tallent +6 more
TL;DR: An overview of HPCTOOLKIT is provided and its utility for performance analysis of parallel applications is illustrated.
Journal ArticleDOI
OpenAD/F: A Modular Open-Source Tool for Automatic Differentiation of Fortran Codes
Jean Utke,Uwe Naumann,Mike Fagan,Nathan R. Tallent,Michelle Mills Strout,Patrick Heimbach,Chris Hill,Carl Wunsch +7 more
TL;DR: The Open/ADF tool allows the evaluation of derivatives of functions defined by a Fortran program, and supports various code reversal schemes with hierarchical checkpointing at the subroutine level for the generation of adjoint codes.
Proceedings ArticleDOI
Effective performance measurement and analysis of multithreaded applications
TL;DR: This paper describes how to measure and attribute parallel idleness, namely, where threads are stalled and unable to work, and arbitrary performance metrics for high-level multithreaded programming models, such as Cilk.
Journal ArticleDOI
HPCVIEW: A Tool for Top-down Analysis of Node Performance
TL;DR: It is argued that HPCVIEW addresses many of the issues that have limited the usability and the utility of most existing performance tools.
Journal ArticleDOI
Evaluating Modern GPU Interconnect: PCIe, NVLink, NV-SLI, NVSwitch and GPUDirect
TL;DR: A thorough evaluation on five latest types of modern GPU interconnects from six high-end servers and HPC platforms shows that, for an application running in a multi-GPU node, choosing the right GPU combination can impose considerable impact on GPU communication efficiency, as well as the application's overall performance.