Topic

Cache pollution

About: Cache pollution is a research topic. Over the lifetime, 11353 publications have been published within this topic receiving 262139 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Optimizing instruction cache performance for operating system intensive workloads

[...]

Josep Torrellas¹, Chun Xia¹, R.L. Daigle¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

22 Jan 1995

TL;DR: This paper characterizes in detail the locality patterns of the operating system code and shows that there is substantial locality, and proposes an algorithm to expose these localities and reduce interference.

...read moreread less

Abstract: High instruction cache hit rates are key to high performance. One known technique to improve the hit rate of caches is to use an optimizing compiler to minimize cache interference via an improved layout of the code. This technique, however, has been applied to application code only, even though there is evidence that the operating system often uses the cache heavily and with less uniform patterns than applications. Therefore, it is unknown how well existing optimizations perform for systems code and whether better optimizations can be found. We address this problem in this paper. This paper characterizes in detail the locality patterns of the operating system code and shows that there is substantial locality. Unfortunately, caches are not able to extract much of it: rarely-executed special-case code disrupts spatial locality, loops with few iterations that call routines make loop locality hard to exploit, and plenty of loop-less code hampers temporal locality. As a result, interference within popular execution paths dominates instruction cache misses. Based on our observations, we propose an algorithm to expose these localities and reduce interference. For a range of cache sizes, associativities, lines sizes, and other organizations we show that we reduce total instruction miss rates by 31-86% (up to 2.9 absolute points). Using a simple model this corresponds to execution time reductions in the order of 12-26%. In addition, our optimized operating system combines well with optimized or unoptimized applications. >

...read moreread less

63 citations

Patent•

Logical to physical address mapping in storage systems comprising solid state memory devices

[...]

Werner Bux¹, Robert Haas¹, Xiaoyu Hu¹, Roman A. Pletka¹•Institutions (1)

IBM¹

25 Jul 2011

TL;DR: In this paper, the main memory of the controller is not blocked by a complete address mapping table covering the entire memory device, instead such table is stored in the memory device itself, and only selected portions of address mapping information are buffered in a read cache (311) and a write cache (312), enabling an address mapping entry being evictable from the read cache without the need to update the related flash memory page storing such entry in the flash memory device.

...read moreread less

Abstract: The present idea provides a high read and write performance from/to a solid state memory device. The main memory (31) of the controller (1) is not blocked by a complete address mapping table covering the entire memory device (2). Instead such table is stored in the memory device (2) itself, and only selected portions of address mapping information are buffered in the main memory (31) in a read cache (311) and a write cache (312). A separation of the read cache (311) from the write cache (312) enables an address mapping entry being evictable from the read cache (311) without the need to update the related flash memory page storing such entry in the flash memory device (2). By this design, the read cache (311) may advantageously be stored on a DRAM even without power down protection, while the write cache (312) may preferably be implemented in nonvolatile or other fail-safe memory. This leads to a reduction of the overall provisioning of nonvolatile or fail-safe memory and to an improved scalability and performance.

...read moreread less

63 citations

Proceedings Article•DOI•

Fast configurable-cache tuning with a unified second-level cache

[...]

Ann Gordon-Ross¹, Frank Vahid¹, Nikil Dutt²•Institutions (2)

University of California, Riverside¹, University of California, Irvine²

08 Aug 2005

TL;DR: In this article, the authors used a commercially-common unified second level, a seemingly minor difference that actually expands the configuration space from 500 to about 20,000, yielding 62% energy savings and 35% performance improvements over a non-configurable cache.

...read moreread less

Abstract: Tuning a configurable cache subsystem to an application can greatly reduce memory hierarchy energy consumption. Previous tuning methods use a level one configurable cache only, or a second level with separate instruction and data configurable caches. The authors instead used a commercially-common unified second level, a seemingly minor difference that actually expands the configuration space from 500 to about 20,000. Additive way tuning for tuning a cache subsystem was developed with this large space, yielding 62% energy savings and 35% performance improvements over a non-configurable cache, greatly outperforming an extension of a previous method.

...read moreread less

63 citations

Patent•

Cache and method for cache bypass functionality

[...]

Blaine D. Gaither¹, Patrick Knebel¹•Institutions (1)

Hewlett-Packard¹

31 Oct 2006

TL;DR: In this article, a cache is provided for operatively coupling a processor with a main memory, which includes a cache memory and a cache controller operatively coupled with the cache memory.

...read moreread less

Abstract: A cache is provided for operatively coupling a processor with a main memory. The cache includes a cache memory and a cache controller operatively coupled with the cache memory. The cache controller is configured to receive memory requests to be satisfied by the cache memory or the main memory. In addition, the cache controller is configured to process cache activity information to cause at least one of the memory requests to bypass the cache memory.

...read moreread less

63 citations

Proceedings Article•DOI•

Analysis of power consumption in memory hierarchies

[...]

Patrick Hicks¹, Matthew Walnock¹, Robert Michael Owens¹•Institutions (1)

Pennsylvania State University¹

01 Aug 1997

TL;DR: In this paper, the authors note and analyze a key trade-off: as the complexity of caches increases (higher set-associativity, larger block size and larger overall size), the power consumed by a cache access increases.

...read moreread less

Abstract: In this paper, we note and analyze a key trade-off: as the complexity of caches increases (higher set-associativity, larger block size, and larger overall size), the power consumed by a cache access increases. However, because the hit rate also increases, the number of main memory accesses decreases and thus the power consumed by a memory access decreases. Recent papers which consider the power consumption of caches tend to ignore hit rates. This is unfortunate, because it is undesirable to have energy-efficient caches which are also very slow. Hit rates also play a key role in truly evaluating the energy efficiency of a cache, because low hit rates lead to more frequent main memory accesses which consume more power than cache accesses.

...read moreread less

63 citations

Collapse

Network Information

Performance

Metrics

11,507

Papers

268,081

Citations

No. of papers in the topic in previous years
Year	Papers
2023	42
2022	110
2021	12
2020	20
2019	15
2018	30

Cache pollution

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics