scispace - formally typeset
P

Per Stenström

Researcher at Chalmers University of Technology

Publications -  251
Citations -  8514

Per Stenström is an academic researcher from Chalmers University of Technology. The author has contributed to research in topics: Cache & Cache coherence. The author has an hindex of 43, co-authored 245 publications receiving 8193 citations. Previous affiliations of Per Stenström include Stanford University & Ericsson.

Papers
More filters
Proceedings ArticleDOI

QoS-Driven Coordinated Management of Resources to Save Energy in Multi-core Systems

TL;DR: It is shown that the proposed coordinated RMA is capable of saving, on average, 20% energy as compared to 15% for DVFS alone and 7% for cache partitioning alone, when the performance target is set to 70% of the baseline system performance.
Proceedings ArticleDOI

Performance evaluation of link-based cache coherence schemes

TL;DR: It was found that tree-based and linear-list protocols performed almost as well as full-map protocols but with a considerably lower implementation cost, however, if the sharing set is large, linear- list schemes may suffer because of the large write latency while tree- based protocols still perform well.
Journal ArticleDOI

PATer: A Hardware Prefetching Automatic Tuner on IBM POWER8 Processor

TL;DR: This paper develops a dynamic prefetching tuning scheme, named prefetch automatic tuner (PATer), which uses a prediction model based on machine learning to dynamically tune the prefetch configuration based on the values of hardware performance monitoring counters (PMCs).
Proceedings ArticleDOI

The FAB predictor: using Fourier analysis to predict the outcome of conditional branches

TL;DR: Some key properties of the FAB predictor are shown and some possible implementation approaches are presented, which could cut the misprediction rate of integer applications in the SPEC95 suite by between 5 and 50% with an average of 20%.
Journal ArticleDOI

Coordinated management of DVFS and cache partitioning under QoS constraints to save energy in multi-core systems

TL;DR: In this paper, a QoS-driven coordinated resource management algorithm (RMA) is proposed to dynamically adjust the size of the last-level cache partitions and the per-core voltage-frequency settings to save energy while respecting QoS requirements of every application.