D
Derek R. Hower
Researcher at Qualcomm
Publications - 34
Citations - 4677
Derek R. Hower is an academic researcher from Qualcomm. The author has contributed to research in topics: Cache & Shared memory. The author has an hindex of 12, co-authored 34 publications receiving 3985 citations. Previous affiliations of Derek R. Hower include Advanced Micro Devices & Duke University.
Papers
More filters
Journal ArticleDOI
The gem5 simulator
Nathan Binkert,Bradford M. Beckmann,Gabriel Black,Steven K. Reinhardt,Ali G. Saidi,Arkaprava Basu,Joel Hestness,Derek R. Hower,Tushar Krishna,Somayeh Sardashti,Rathijit Sen,Korey Sewell,Muhammad Shoaib,Nilay Vaish,Mark D. Hill,Darien Wood +15 more
TL;DR: The high level of collaboration on the gem5 project, combined with the previous success of the component parts and a liberal BSD-like license, make gem5 a valuable full-system simulation tool.
Journal ArticleDOI
Rerun: Exploiting Episodes for Lightweight Memory Race Recording
Derek R. Hower,Mark D. Hill +1 more
TL;DR: A memory race recording mechanism, called Rerun, that uses small hardware state, writes a small race log, and operates well as the number of cores per system scales (e.g., to 16 cores).
Proceedings ArticleDOI
Heterogeneous-race-free memory models
Derek R. Hower,Blake A. Hechtman,Bradford M. Beckmann,Benedict R. Gaster,Mark D. Hill,Steven K. Reinhardt,Darien Wood +6 more
TL;DR: A new class of memory consistency models that add scoped synchronization to data-race-free models like those of C++ and Java, and quanti-tatively shows that HRF-indirect encourages forward-looking programs with irregular parallelism by showing up to a 10% performance increase in a task runtime for GPUs.
Proceedings ArticleDOI
QuickRelease: A throughput-oriented approach to release consistency on GPUs
Blake A. Hechtman,Shuai Che,Derek R. Hower,Yingying Tian,Bradford M. Beckmann,Mark D. Hill,Steven K. Reinhardt,Darien Wood +7 more
TL;DR: QuickRelease (QR), which improves on conventional GPU memory systems in two ways, uses a FIFO to enforce the partial order of writes so that synchronization operations can complete without frequent cache flushes, and provides a throughput-oriented solution to provide fine-grain synchronization on GPUs.
Proceedings ArticleDOI
Calvin: Deterministic or not? Free will to choose
TL;DR: This work develops a Calvin-MIST implementation that uses an unordered coalescing write cache, multiple-write coherence protocol, and delayed (timebomb) invalidations while maintaining TSO compatibility, and demonstrates the flexibility of a memory consistency model weaker than sequential consistency.