Topic

Cache invalidation

About: Cache invalidation is a research topic. Over the lifetime, 10539 publications have been published within this topic receiving 245409 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Dynamic hierarchical caching in large-scale distributed file systems

[...]

M. Blaze¹, Rafael Alonso•Institutions (1)

Princeton University¹

09 Jun 1992

TL;DR: In this article, a trace-driven simulation of a dynamic hierarchical file system is presented, and a reduction in server traffic of a factor of more than two for shared files compared with a flat scheme is obtained, without a large increase in client access time.

...read moreread less

Abstract: A simple method for constructing dynamic heirarchies on a file-by-file basis is described. The results of a trace-driven simulation of a dynamic hierarchical file system are presented. A reduction in server traffic of a factor of more than two for shared files compared with a flat scheme is obtained, without a large increase in client access time. Low-overhead techniques for maintaining cache consistency by detecting missed cache invalidation are discussed. >

...read moreread less

58 citations

Proceedings Article•DOI•

Designing high bandwidth on-chip caches

[...]

Kenneth M. Wilson¹, Kunle Olukotun¹•Institutions (1)

Stanford University¹

01 May 1997

TL;DR: The results show that a large dual-ported multi-cycle pipelined SRAM cache with a line buffer maximizes processor performance.

...read moreread less

Abstract: In this paper we evaluate the performance of high bandwidth caches that employ multiple ports, multiple cycle hit times, on-chip DRAM, and a line buffer to find the organization that provides the best processor performance. Processor performance is measured in execution time using a dynamic superscalar processor running realistic benchmarks that include operating system references. The results show that a large dual-ported multi-cycle pipelined SRAM cache with a line buffer maximizes processor performance. A large pipelined cache provides both a low miss rate and a high CPU clock frequency. Dual-porting the cache and the use of a line buffer provide the bandwidth needed by a dynamic superscalar processor. In addition, the line buffer makes the pipelined dual-ported cache the best option by increasing cache port bandwidth and hiding cache latency.

...read moreread less

58 citations

Patent•

Data cache invalidate with data dependent expiration using a step value

[...]

Jeffrey Douglas Brown¹, Russell D. Hoover¹, Eric O. Mejdrich¹, Kenneth M. Valk¹•Institutions (1)

IBM¹

12 Jul 2007

TL;DR: In this paper, a step value and a step-interval cache coherency protocol are used to update and invalidate data stored within cache memory, where the step value may be an integer value and may be stored within a cache directory entry associated with data in the memory cache.

...read moreread less

Abstract: According to embodiments of the invention, a step value and a step-interval cache coherency protocol may be used to update and invalidate data stored within cache memory. A step value may be an integer value and may be stored within a cache directory entry associated with data in the memory cache. Upon reception of a cache read request, along with the normal address comparison to determine if the data is located within the cache a current step value may be compared with the stored step value to determine if the data is current. If the step values match, the data may be current and a cache hit may occur. However, if the step values do not match, the requested data may be provided from another source. Furthermore, an application may update the current step value to invalidate old data stored within the cache and associated with a different step value.

...read moreread less

58 citations

Patent•

Apparatus and method of maintaining cache coherency in a multi-processor computer system with global and local recently read states

[...]

Ravi Kumar Arimilli¹, John Steven Dodson¹, Jerry Don Lewis¹•Institutions (1)

IBM¹

17 Feb 1998

TL;DR: In this paper, a cache coherency protocol has been proposed for multi-processor computer systems with clustered processing units, where a cache block containing a valid copy of a value (instruction or data) was the most recently accessed block out of a group of cache blocks in different caches that share valid copies of the value.

...read moreread less

Abstract: A multi-processor computer system with clustered processing units uses a cache coherency protocol having a "recent" coherency state to indicate that a particular cache block containing a valid copy of a value (instruction or data) was the most recently accessed block out of a group of cache blocks in different caches (but at the same cache level) that share valid copies of the value. The "recent" state can advantageously be used to implement optimized memory operations such as intervention, by sourcing the value from the cache block in the "recent" state, as opposed to sourcing the value from system memory (RAM), which would be a slower operation. In an exemplary implementation, the hierarchy has two cache levels supporting a given processing unit cluster; the "recent" state can be applied to a plurality of caches at the first level (each associated with a different processing unit cluster), and the "recent" state can further be applied to one of the caches at the second level.

...read moreread less

58 citations

Proceedings Article•DOI•

A hardware-based cache pollution filtering mechanism for aggressive prefetches

[...]

X. Zhuang¹, Hsien-Hsin S. Lee¹•Institutions (1)

Georgia Institute of Technology¹

27 Oct 2003

TL;DR: The cache pollution filters can significantly reduce the number of ineffective prefetches by over 90%, alleviating the excessive memory bandwidth induced by them and the IPC is improved by up to 9% as a result of reduced cache pollution and less competition for the limited number of cache ports.

...read moreread less

Abstract: Aggressive hardware-based and software-based prefetch algorithms for hiding memory access latencies were proposed to bridge the gap of the expanding speed disparity between processors and memory subsystems. As smaller L1 caches prevail in deep submicron processor designs in order to maintain short cache access cycles, cache pollution caused by ineffective prefetches is becoming a major challenge. When too aggressive prefetching are applied, ineffective prefetches not only can offset the benefits of benign prefetches due to pollution but also throttle bus bandwidth, leading to overall performance degradation. A hardware based cache pollution filtering mechanism is proposed to differentiate good and bad prefetches dynamically using a history table. Two schemes-peraddress (PA) based and program counter (PC) based-for triggering prefetches are proposed and evaluated. Our cache pollution filters work in tandem with both hardware and software prefetchers. As shown in the analysis of our simulated results, the cache pollution filters can significantly reduce the number of ineffective prefetches by over 90%, alleviating the excessive memory bandwidth induced by them. The IPC is improved by up to 9% as a result of reduced cache pollution and less competition for the limited number of cache ports

...read moreread less

58 citations

Collapse

Network Information

Performance

Metrics

10,702

Papers

250,710

Citations

No. of papers in the topic in previous years
Year	Papers
2023	44
2022	117
2021	4
2020	8
2019	7
2018	20

Cache invalidation

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics