P
Per Stenström
Researcher at Chalmers University of Technology
Publications - 251
Citations - 8514
Per Stenström is an academic researcher from Chalmers University of Technology. The author has contributed to research in topics: Cache & Cache coherence. The author has an hindex of 43, co-authored 245 publications receiving 8193 citations. Previous affiliations of Per Stenström include Stanford University & Ericsson.
Papers
More filters
Journal ArticleDOI
Improving power efficiency of D-NUCA caches
Alessandro Bardine,Pierfrancesco Foglia,Giacomo Gabrielli,Cosimo Antonio Prete,Per Stenström +4 more
TL;DR: This paper proposes a new D-NUCA structure, called Way Adaptable D- NUCA cache, in which the number of active ways is dynamically adapted to the need of the running application, and shows how such a behavior can be utilized to improve D-nUCA power efficiency as well as to decrease its access latencies.
Multicore computing--the state of the art
Karl-Filip Faxén,Christer Bengtsson,Mats Brorsson,Håkan Grahn,Erik Hagersten,Bengt Jönsson,Christoph Kessler,Björn Lisper,Per Stenström,Bertil Svensson +9 more
TL;DR: This document presents the current state of the art in multicore computing, in hardware and software, as well as ongoing activities, especially in Sweden.
Book ChapterDOI
A Flexible Code Compression Scheme Using Partitioned Look-Up Tables
TL;DR: A code compression scheme that allows the compiler to select what subset of a wide instruction set to use in each program phase at the granularity of basic blocks based on a profiling methodology is presented.
Patent
System and method for coherence prediction
TL;DR: In this paper, a coherence prediction mechanism includes a synchronization manager and a plurality of access predictors, each of which maintains one or more sequence entries indicating a sequence in which a corresponding data block is accessed by two or more processing elements of a multiprocessor system.
Proceedings Article
Visualising Sharing Behaviour in relation to Shared Memory Management
Mats Brorsson,Per Stenström +1 more
TL;DR: This paper presents a methodology to capture and visu-alise the sharing behaviour of a parallel program with respect to the choice coherence mechanisms, and identifies four conceptual workload parameters: Spatial granularity, Degree of sharing, Access mode, and the Temporal Granu-larity.