Topic

Smart Cache

About: Smart Cache is a research topic. Over the lifetime, 7680 publications have been published within this topic receiving 180618 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Patent•

Memory system including local and global caches for storing floating point and integer data

[...]

John Brennan¹, Peter Yan-Tek Hsu¹, William A. Huffman¹, Paul Rodman¹, Joseph T. Scanlon¹, Kit Man Tang¹, Steve J. Ciavaglia¹ - Show less +3 more•Institutions (1)

MIPS Technologies¹

15 Dec 1993

TL;DR: In this paper, a split level cache memory system for a data processor includes a single chip integer unit, an ARM processor such as a floating point unit and an external main memory.

...read moreread less

Abstract: A split level cache memory system for a data processor includes a single chip integer unit, an army processor such as a floating point unit, an external main memory and a split level cache. The split level cache includes an on-chip, fast local cache with low latency for use by the integer unit for loads and stores of integer and address data and an off-chip, pipelined global cache for storing arrays of data such as floating point data for use by the array processor and integer and address data for refilling the local cache. Coherence between the local cache and global cache is maintained by writing through to the global cache during integer stores. Local cache words are invalidated when data is written to the global cache during an army processor store.

...read moreread less

52 citations

Proceedings Article•DOI•

Optimal Cache Partition-Sharing

[...]

Jacob Brock¹, Chencheng Ye¹, Chen Ding¹, Yechen Li², Xiaolin Wang², Yingwei Luo² - Show less +2 more•Institutions (2)

University of Rochester¹, Peking University²

01 Sep 2015

TL;DR: The theory shows that theproblem of partition-sharing is reducible to the problem of partitioning, and the technique uses dynamic programming to optimize partitioning for overall miss ratio, and for two different kinds of fairness.

...read moreread less

Abstract: When a cache is shared by multiple cores, its space may be allocated either by sharing, partitioning, or both. We call the last case partition-sharing. This paper studies partition-sharing as a general solution, and presents a theory an technique for optimizing partition-sharing. We present a theory and a technique to optimize partition sharing. The theory shows that the problem of partition-sharing is reducible to the problem of partitioning. The technique uses dynamic programming to optimize partitioning for overall miss ratio, and for two different kinds of fairness. Finally, the paper evaluates the effect of optimal cache sharing and compares it with conventional solutions for thousands of 4-program co-run groups, with nearly 180 million different ways to share the cache by each co-run group. Optimal partition-sharing is on average 26% better than free-for-all sharing, and 98% better than equal partitioning. We also demonstrate the trade-off between optimal partitioning and fair partitioning.

...read moreread less

52 citations

Proceedings Article•

CacheDedup: in-line deduplication for flash caching

[...]

Wenji Li¹, Gregory Jean-Baptise², Juan Riveros², Giri Narasimhan², Tong Zhang³, Ming Zhao¹ - Show less +2 more•Institutions (3)

Arizona State University¹, Florida International University², Rensselaer Polytechnic Institute³

22 Feb 2016

TL;DR: The paper presents a rigorous analysis of the algorithms to prove that they do not waste valuable cache space and that they are efficient in time and space usage, and shows that they can be extended to support the combination of compression and deduplication for flash caching and improve its performance and endurance.

...read moreread less

Abstract: Flash caching has emerged as a promising solution to the scalability problems of storage systems by using fast flash memory devices as the cache for slower primary storage. But its adoption faces serious obstacles due to the limited capacity and endurance of flash devices. This paper presents CacheDedup, a solution that addresses these limitations using in-line deduplication. First, it proposes a novel architecture that integrates the caching of data and deduplication metadata (source addresses and fingerprints of the data) and efficiently manages these two components. Second, it proposes duplication-aware cache replacement algorithms (D-LRU, DARC) to optimize both cache performance and endurance. The paper presents a rigorous analysis of the algorithms to prove that they do not waste valuable cache space and that they are efficient in time and space usage. The paper also includes an experimental evaluation using real-world traces, which confirms that CacheDedup substantially improves I/O performance (up to 20% reduction in miss ratio and 51% in latency) and flash endurance (up to 89% reduction in writes sent to the cache device) compared to traditional cache management. It also shows that the proposed architecture and algorithms can be extended to support the combination of compression and deduplication for flash caching and improve its performance and endurance.

...read moreread less

52 citations

Proceedings Article•

The age penalty and its effect on cache performance

[...]

Edith Cohen¹, Haim Kaplan²•Institutions (2)

AT&T Labs¹, Tel Aviv University²

26 Mar 2001

TL;DR: The extent of the age penalty for content served by content delivery networks and large caches is measured and it is demonstrated that it can be significant, and moreover, can highly vary under different practices.

...read moreread less

Abstract: Web content caching is recognized as an effective mechanism to decrease server load, network traffic, and user-perceived latency. An HTTP compliant cache associates with each cached object an expiration time calculated according to directives set by the object's origin server. The cache incurs a miss when it has no cached copy of a requested object or when the existing copy had expired (is not fresh). Upon a miss, the cache needs to fetch or validate a copy through exchanges with another cache with a fresh copy or the origin server. Thus, misses generate traffic and prolong service times. Caches are deployed as proxies, reverse proxies, and hierarchically and as a result, caches often serve other caches. As this happens, content age at higher-level caches, in addition to availability and freshness, emerges as a performance factor. The age of a cached copy of an object is the elapsed time since fetched from the respective origin. Fresh cached copies of the same object can have different ages and older copies typically expire sooner. Therefore, a proxy cache would suffer a higher miss rate if it receives older objects (e.g., from a reverse-proxy cache). Similarly, reverse-proxy caches that serve proxy-caches receive more requests than an origin server would have received. We refer to the increase in miss rate due to age as the age penalty. We use trace-based simulations to measure the extent of the age penalty for content served by content delivery networks and large caches. Even though the age penalty had not been considered previously, we demonstrate that it can be significant, and moreover, can highly vary under different practices.

...read moreread less

52 citations

Proceedings Article•DOI•

An energy-efficient adaptive hybrid cache

[...]

Jason Cong¹, Karthik Gururaj¹, Hui Huang¹, Chunyue Liu¹, Glenn Reinman¹, Yi Zou¹ - Show less +2 more•Institutions (1)

University of California, Los Angeles¹

01 Aug 2011

TL;DR: An adaptive hybrid cache is proposed to dynamically remap SPM blocks from high-demand cache sets to low-demand caches, which achieves 19%, 25%, 18% and 18% energy-runtime-production reductions over four previous representative techniques on a wide range of benchmarks.

...read moreread less

Abstract: By reconfiguring part of the cache as software-managed scratchpad memory (SPM), hybrid caches manage to handle both unknown and predictable memory access patterns. However, existing hybrid caches provide a flexible partitioning of cache and SPM without considering adaptation to the run-time cache behavior. Previous cache set balancing techniques are either energy-inefficient or require serial tag and data array access. In this paper an adaptive hybrid cache is proposed to dynamically remap SPM blocks from high-demand cache sets to low-demand cache sets. This achieves 19%, 25%, 18% and 18% energy-runtime-production reductions over four previous representative techniques on a wide range of benchmarks.

...read moreread less

52 citations

Collapse

Network Information

Performance

Metrics

7,847

Papers

186,437

Citations

No. of papers in the topic in previous years
Year	Papers
2023	50
2022	114
2021	5
2020	1
2019	8
2018	18

Smart Cache

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics