scispace - formally typeset
Book ChapterDOI

Identifying Dynamic Replication Strategies for a High-Performance Data Grid

Reads0
Chats0
TLDR
A simulation framework that is developed to enable comparative studies of alternative dynamic replication strategies and preliminary results obtained with this simulator show that significant savings in latency and bandwidth can be obtained if the access patterns contain a small degree of geographical locality.
Abstract
Dynamic replication can be used to reduce bandwidth consumption and access latency in high performance "data grids" where users require remote access to large files. Different replication strategies can be defined depending on when, where, and how replicas are created and destroyed. We describe a simulation framework that we have developed to enable comparative studies of alternative dynamic replication strategies. We present preliminary results obtained with this simulator, in which we evaluate the performance of five different replication strategies for three different kinds of access patterns. The data in ths scenario is read-only and so there are no consistency issues involved. The simulation results show that significant savings in latency and bandwidth can be obtained if the access patterns contain a small degree of geographical locality.

read more

Citations
More filters
Journal ArticleDOI

Mapping Abstract Complex Workflows onto Grid Environments

TL;DR: The current ACWG based on AI planning technologies is described and it is outlined how these technologies can play a crucial role in developing complex application workflows in Grid environments.
Proceedings ArticleDOI

Decoupling computation and data scheduling in distributed data-intensive applications

TL;DR: This work develops a family of algorithms and uses simulation studies to evaluate various combinations of these algorithms to suggest that while it is necessary to consider the impact of replication, it is not always necessary to couple data movement and computation scheduling.

Scheduling Algorithms for Grid Computing: State of the Art and Open Problems

TL;DR: This survey provides a review of the subject of Grid scheduling mainly from the perspective of scheduling algorithms, and identifies the challenges and state of the art of current research.
Book

Grid resource management: state of the art and future trends

TL;DR: An introduction to Grid applications and technologies is provided and the important role that resource management will play in future developments is discussed.
Journal ArticleDOI

Optorsim: A Grid Simulator for Studying Dynamic Data Replication Strategies

TL;DR: This paper details the design and implementation of OptorSim and analyze various replication algorithms based on different Grid workloads, and provides a modular framework within which optimization strategies can be studied under different Grid configurations.
References
More filters
Book

The Grid 2: Blueprint for a New Computing Infrastructure

TL;DR: The Globus Toolkit as discussed by the authors is a toolkit for high-throughput resource management for distributed supercomputing applications, focusing on real-time wide-distributed instrumentation systems.
Journal ArticleDOI

The Anatomy of the Grid: Enabling Scalable Virtual Organizations

TL;DR: The authors present an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing.
Journal ArticleDOI

The GRID: Blueprint for a New Computing Infrastructure

TL;DR: The main purpose is to update the designers and users of parallel numerical algorithms with the latest research in the field and present the novel ideas, results and work in progress and advancing state-of-the-art techniques in the area of parallel and distributed computing for numerical and computational optimization problems in scientific and engineering application.
Posted Content

The Anatomy of the Grid - Enabling Scalable Virtual Organizations

TL;DR: This article reviews the "Grid problem," and presents an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing.
Journal ArticleDOI

Summary cache: a scalable wide-area web cache sharing protocol

TL;DR: This paper demonstrates the benefits of cache sharing, measures the overhead of the existing protocols, and proposes a new protocol called "summary cache", which reduces the number of intercache protocol messages, reduces the bandwidth consumption, and eliminates 30% to 95% of the protocol CPU overhead, all while maintaining almost the same cache hit ratios as ICP.
Related Papers (5)