A quantitative analysis of cache policies for scalable network file systems

doi:10.1145/183018.183034

Citations

PDF

Open Access

More filters

Book•

Serverless Network File Systems

[...]

Thomas Anderson¹, Mike Dahlin¹, Jeanna M. Neefe¹, David A. Patterson¹, Drew Roselli¹, Randolph Y. Wang¹ - Show less +2 more•Institutions (1)

University of California, Berkeley¹

01 Dec 1995

TL;DR: A new paradigm for network file system design, serverless network file systems that utilizes workstations cooperating as peers to provide all file system services to provide better performance and scalability than traditional file systems.

...read moreread less

Abstract: In this paper, we propose a new paradigm for network file system design, serverless network file systems. While traditional network file systems rely on a central server machine, a serverless system utilizes workstations cooperating as peers to provide all file system services. Any machine in the system can store, cache, or control any block of data. Our approach uses this location independence, in combination with fast local area networks, to provide better performance and scalability than traditional file systems. Further, because any machine in the system can assume the responsibilities of a failed component, our serverless design also provides high availability via redundant data storage. To demonstrate our approach, we have implemented a prototype serverless network file system called xFS. Preliminary performance measurements suggest that our architecture achieves its goal of scalability. For instance, in a 32-node xFS system with 32 active clients, each client receives nearly as much read or write throughput as it would see if it were the only active client.

...read moreread less

626 citations

Proceedings Article•

A comparison of file system workloads

[...]

Drew Roselli¹, Jacob R. Lorch¹, Thomas Anderson¹•Institutions (1)

University of California, Berkeley¹

18 Jun 2000

TL;DR: This paper describes the collection and analysis of file system traces from a variety of different environments, including both UNIX and NT systems, clients and servers, and instructional and production systems and develops a new metric for measuring file lifetime that accounts for files that are never deleted.

...read moreread less

Abstract: In this paper, we describe the collection and analysis of file system traces from a variety of different environments, including both UNIX and NT systems, clients and servers, and instructional and production systems. Our goal is to understand how modern workloads affect the ability of file systems to provide high performance to users. Because of the increasing gap between processor speed and disk latency, file system performance is largely determined by its disk behavior. Therefore we primarily focus on the disk I/O aspects of the traces. We find that more processes access files via the memory-map interface than through the read interface. However, because many processes memory-map a small set of files, these files are likely to be cached. We also find that file access has a bimodal distribution pattern: some files are written repeatedly without being read; other files are almost exclusively read. We develop a new metric for measuring file lifetime that accounts for files that are never deleted. Using this metric, we find that the average block lifetime for some workloads is significantly longer than the 30-second write delay used by many file systems. However, all workloads show lifetime locality: the same files tend to be overwritten multiple times.

...read moreread less

507 citations

Proceedings Article•DOI•

Cooperative caching: using remote client memory to improve file system performance

[...]

Mike Dahlin¹, Randolph Y. Wang¹, Thomas Anderson¹, David A. Patterson¹•Institutions (1)

University of California, Berkeley¹

14 Nov 1994

TL;DR: In this article, the authors examine four cooperative caching algorithms using a trace-driven simulation study and conclude that cooperative caching can significantly improve file system read response time and that relatively simple cooperative caching methods are sufficient to realize most of the potential performance gain.

...read moreread less

Abstract: Emerging high-speed networks will allow machines to access remote data nearly as quickly as they can access local data. This trend motivates the use of cooperative caching: coordinating the file caches of many machines distributed on a LAN to form a more effective overall file cache. In this paper we examine four cooperative caching algorithms using a trace-driven simulation study. These simulations indicate that for the systems studied cooperative caching can halve the number of disk accesses, improving file system read response time by as much as 73%. Based on these simulations we conclude that cooperative caching can significantly improve file system read response time and that relatively simple cooperative caching algorithms are sufficient to realize most of the potential performance gain.

...read moreread less

491 citations

Proceedings Article•

The Multi-Queue Replacement Algorithm for Second Level Buffer Caches

[...]

Yuanyuan Zhou, James Philbin, Kai Li

25 Jun 2001

344 citations

Proceedings Article•

World-wide web cache consistency

[...]

James S. Gwertzman¹, Margo Seltzer²•Institutions (2)

Microsoft¹, Harvard University²

22 Jan 1996

TL;DR: Using trace-driven simulation, it is shown that a weak cache consistency protocol (the one used in the Alex ftp cache) reduces network bandwidth consumption and server load more than either time-to-live fields or an invalidation protocol and can be tuned to return stale data less than 5% of the time.

...read moreread less

Abstract: The bandwidth demands of the World Wide Web continue to grow at a hyper-exponential rate. Given this rocketing growth, caching of web objects as a means to reduce network bandwidth consumption is likely to be a necessity in the very near future. Unfortunately, many Web caches do not satisfactorily maintain cache consistency. This paper presents a survey of contemporary cache consistency mechanisms in use on the Internet today and examines recent research in Web cache consistency. Using trace-driven simulation, we show that a weak cache consistency protocol (the one used in the Alex ftp cache) reduces network bandwidth consumption and server load more than either time-to-live fields or an invalidation protocol and can be tuned to return stale data less than 5% of the time.

...read moreread less

342 citations

Collapse

A quantitative analysis of cache policies for scalable network file systems

Citations

References

Related Papers (5)

Trending Questions (1)