scispace - formally typeset
P

Per Stenström

Researcher at Chalmers University of Technology

Publications -  251
Citations -  8514

Per Stenström is an academic researcher from Chalmers University of Technology. The author has contributed to research in topics: Cache & Cache coherence. The author has an hindex of 43, co-authored 245 publications receiving 8193 citations. Previous affiliations of Per Stenström include Stanford University & Ericsson.

Papers
More filters
Proceedings ArticleDOI

Eager Meets Lazy: The Impact of Write-Buffering on Hardware Transactional Memory

TL;DR: The insights, related to the interplay between buffering mechanisms, system policies and workload characteristics, contained in this paper clearly distinguish gains in performance to be had from write-buffering from those that can be ascribed to HTM policy.
Proceedings ArticleDOI

Reducing misspeculation overhead for module-level speculative execution

TL;DR: History-based prediction is used in an attempt to prevent speculative threads from being spawned when they are expected to cause misspeculations and it is found that the overhead can be reduced with a factor of six on average compared to indiscriminate speculation.
Proceedings ArticleDOI

Timing-anomaly free dynamic scheduling of task-based parallel applications

TL;DR: A new timing-anomaly-free dynamic scheduling algorithm, called the Out-of-(priority)-order Lazy (O-Lazy) that offers safe and tighter estimation of the makespan of parallel applications.
Proceedings ArticleDOI

Classification and Elimination of Conflicts in Hardware Transactional Memory Systems

TL;DR: This paper analyzes the sources of performance losses in hardware transactional memory and investigates techniques to reduce the losses and proposes the 5C cache-miss classification model that extends the well-established 4C model with a new class of cache misses known as contamination misses.
Journal ArticleDOI

Eager Beats Lazy: Improving Store Management in Eager Hardware Transactional Memory

TL;DR: This work presents two simple ways to improve handling of speculative stores--a way to effectively manage lines that exhibit migratory sharing and a way to hide store latency, particularly for those stores that target contended cache lines owned by other concurrent transactions.