Showing papers by "Carl Kesselman published in 2001"

PDF

Open Access

Journal Article•DOI•

The Anatomy of the Grid: Enabling Scalable Virtual Organizations

[...]

Ian Foster¹, Carl Kesselman², Steven Tuecke¹•Institutions (2)

Argonne National Laboratory¹, University of Southern California²

01 Aug 2001

TL;DR: The authors present an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing.

...read moreread less

Abstract: "Grid" computing has emerged as an important new field, distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high performance orientation. In this article, the authors define this new field. First, they review the "Grid problem," which is defined as flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resources--what is referred to as virtual organizations. In such settings, unique authentication, authorization, resource access, resource discovery, and other challenges are encountered. It is this class of problem that is addressed by Grid technologies. Next, the authors present an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing. The authors describe requirements that they believe any such mechanisms must satisfy and discuss the importance of defining a compact set of intergrid protocols to enable interoperability among different Grid systems. Finally, the authors discuss how Grid technologies relate to other contemporary technologies, including enterprise integration, application service provider, storage service provider, and peer-to-peer computing. They maintain that Grid concepts and technologies complement and have much to contribute to these other approaches.

...read moreread less

6,716 citations

Posted Content•

The Anatomy of the Grid - Enabling Scalable Virtual Organizations

[...]

Ian Foster¹, Carl Kesselman², Steven Tuecke¹•Institutions (2)

Argonne National Laboratory¹, University of Southern California²

29 Mar 2001-arXiv: Hardware Architecture

TL;DR: This article reviews the "Grid problem," and presents an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing.

...read moreread less

Abstract: "Grid" computing has emerged as an important new field, distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high-performance orientation. In this article, we define this new field. First, we review the "Grid problem," which we define as flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resources-what we refer to as virtual organizations. In such settings, we encounter unique authentication, authorization, resource access, resource discovery, and other challenges. It is this class of problem that is addressed by Grid technologies. Next, we present an extensible and open Grid architecture, in which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing. We describe requirements that we believe any such mechanisms must satisfy, and we discuss the central role played by the intergrid protocols that enable interoperability among different Grid systems. Finally, we discuss how Grid technologies relate to other contemporary technologies, including enterprise integration, application service provider, storage service provider, and peer-to-peer computing. We maintain that Grid concepts and technologies complement and have much to contribute to these other approaches.

...read moreread less

3,595 citations

Proceedings Article•DOI•

Grid information services for distributed resource sharing

[...]

Karl Czajkowski¹, S. Fitzgerald², Ian Foster³, Carl Kesselman¹•Institutions (3)

University of Southern California¹, California State University, Northridge², University of Chicago³

07 Aug 2001

TL;DR: This work presents an information services architecture that addresses performance, security, scalability, and robustness requirements of Grid software infrastructure and has been implemented as MDS-2, which forms part of the Globus Grid toolkit and has be widely deployed and applied.

...read moreread less

Abstract: Grid technologies enable large-scale sharing of resources within formal or informal consortia of individuals and/or institutions: what are sometimes called virtual organizations. In these settings, the discovery, characterization, and monitoring of resources, services, and computations are challenging problems due to the considerable diversity; large numbers, dynamic behavior, and geographical distribution of the entities in which a user might be interested. Consequently, information services are a vital part of any Grid software infrastructure, providing fundamental mechanisms for discovery and monitoring, and hence for planning and adapting application behavior. We present an information services architecture that addresses performance, security, scalability, and robustness requirements. Our architecture defines simple low-level enquiry and registration protocols that make it easy to incorporate individual entities into various information structures, such as aggregate directories that support a variety of different query languages and discovery strategies. These protocols can also be combined with other Grid protocols to construct additional higher-level services and capabilities such as brokering, monitoring, fault detection, and troubleshooting. Our architecture has been implemented as MDS-2, which forms part of the Globus Grid toolkit and has been widely deployed and applied.

...read moreread less

1,707 citations

Journal Article•DOI•

The GrADS Project: Software Support for High-Level Grid Application Development

[...]

Francine Berman¹, Andrew A. Chien¹, Keith D. Cooper², Jack Dongarra³, Ian Foster⁴, Dennis Gannon⁵, Lennart Johnsson⁶, Ken Kennedy², Carl Kesselman⁷, John Mellor-Crumme², Daniel A. Reed⁸, Linda Torczon², Rich Wolski⁹ - Show less +9 more•Institutions (9)

University of California, San Diego¹, Rice University², University of Tennessee³, Argonne National Laboratory⁴, Indiana University⁵, University of Houston⁶, University of Southern California⁷, University of Illinois at Urbana–Champaign⁸, University of California, Santa Barbara⁹

01 Nov 2001

TL;DR: The goal of the Grid Application Development Software (GrADS) project is to simplify distributed heterogeneous computing in the same way that the World Wide Web simplified information sharing over the Internet.

...read moreread less

Abstract: Advances in networking technologies will soon make it possible to use the global information infrastructure in a qualitatively different way--as a computational as well as an information resource. As described in the recent book The Grid: Blueprint for a New Computing Infrastructure, this Grid will connect the nation's computers, databases, instruments, and people in a seamless web of computing and distributed intelligence, which can be used in an on-demand fashion as a problem-solving resource in many fields of human endeavor--and, in particular, science and engineering. The availability of grid resources will give rise to dramatically new classes of applications, in which computing resources are no longer localized but, rather, distributed, heterogeneous, and dynamic; computation is increasingly sophisticated and multidisciplinary; and computation is integrated into our daily lives and, hence, subject to stricter time constraints than at present. The impact of these new applications will be pervasive, ranging from new systems for scientific inquiry, through computing support for crisis management, to the use of ambient computing to enhance personal mobile computing environments. To realize this vision, significant scientific and technical obstacles must be overcome. Principal among these is usability. The goal of the Grid Application Development Software (GrADS) project is to simplify distributed heterogeneous computing in the same way that the World Wide Web simplified information sharing over the Internet. To that end, the project is exploring the scientific and technical problems that must be solved to make it easier for ordinary scientific users to develop, execute, and tune applications on the Grid. In this paper, the authors describe the vision and strategies underlying the GrADS project, including the base software architecture for grid execution and performance monitoring, strategies and tools for construction of applications from libraries of grid-aware components, and development of innovative new science and engineering applications that can exploit these new technologies to run effectively in grid environments.

...read moreread less

366 citations

Proceedings Article•DOI•

Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing

[...]

Bill Allcock, Joe Bester, John Bresnahan, Ann L. Chervenak¹, Carl Kesselman¹, Sam Meder², Veronika Nefedova², Darcy Quesnel², Steven Tuecke², Ian Foster - Show less +6 more•Institutions (2)

University of Southern California¹, Argonne National Laboratory²

17 Apr 2001

TL;DR: The high-speed transport service, GridFTP, extends the popular FTP protocol with new features required for Data Grid applications, such as striping and partial file access, and the replica management service integrates a replica catalog with gridFTP transfers to provide for the creation, registration, location, and management of dataset replicas.

...read moreread less

Abstract: An emerging class of data-intensive applications involve the geographically dispersed extraction of complex scientific information from very large collections of measured or computed data. Such applications arise, for example, in experimental physics, where the data in question is generated by accelerators, and in simulation science, where the data is generated by supercomputers. So-called Data Grids provide essential infrastructure for such applications, much as the Internet provides essential services for applications such as e-mail and the Web. We describe here two services that we believe are fundamental to any Data Grid: reliable, high-speed transport and replica management. Our high-speed transport service, GridFTP, extends the popular FTP protocol with new features required for Data Grid applications, such as striping and partial file access. Our replica management service integrates a replica catalog with GridFTP transfers to provide for the creation, registration, location, and management of dataset replicas. We present the design of both services and also preliminary performance results. Our implementations exploit security and other services provided by the Globus Toolkit.

...read moreread less

321 citations

Proceedings Article•DOI•

High-Performance Remote Access to Climate Simulation Data: A Challenge Problem for Data Grid Technologies

[...]

Bill Allcock¹, Ian Foster¹, Veronika Nefedova¹, Ann L. Chervenak², Ewa Deelman², Carl Kesselman², Jason Lee³, Alex Sim³, Arie Shoshani³, Bob Drach⁴, Dean N. Williams⁴ - Show less +7 more•Institutions (4)

Argonne National Laboratory¹, University of Southern California², Lawrence Berkeley National Laboratory³, Lawrence Livermore National Laboratory⁴

10 Nov 2001

TL;DR: The Earth System Grid prototype is described, which brings together advanced analysis, replica management, data transfer, request management, and other technologies to support high-performance, interactive analysis of replicated data.

...read moreread less

Abstract: In numerous scientific disciplines, terabyte and soon petabyte-scale data collections are emerging as critical community resources. A new class of Data Grid infrastructure is required to support management, transport, distributed access to, and analysis of these datasets by potentially thousands of users. Researchers who face this challenge include the Climate Modeling community, which performs long-duration computations accompanied by frequent output of very large files that must be further analyzed. We describe the Earth System Grid prototype, which brings together advanced analysis, replica management, data transfer, request management, and other technologies to support high-performance, interactive analysis of replicated data. We present performance results that demonstrate our ability to manage the location and movement of large datasets from the users desktop. We report on experiments conducted over SciNET at SC2000, where we achieved peak performance of 1.55Gb/s and sustained performance of 512.9Mb/s for data transfers between Texas and California.

...read moreread less

160 citations

Journal Article•DOI•

A high-throughput x-ray microtomography system at the Advanced Photon Source

[...]

Yuxin Wang, Francesco De Carlo, Derrick C. Mancini, Ian McNulty, Brian Tieman, John Bresnahan, Ian Foster, Joseph A. Insley, Peter Lane, Gregor von Laszewski, Carl Kesselman, Mei-Hui Su, Marcus Thiebaux - Show less +9 more

27 Mar 2001-Review of Scientific Instruments

TL;DR: A pipelined data acquisition and reconstruction system that combines a fast detector system, high-speed datanetworks, and massively parallel computers to rapidly acquire the projection data and perform construction and rendering calculations.

...read moreread less

Abstract: A third-generation synchrotron radiation source provides enough brilliance to acquire complete tomographic data sets at 100 nm or better resolution in a few minutes. To take advantage of such high-brilliance sources at the Advanced Photon Source, we have constructed a pipelined data acquisition and reconstruction system that combines a fast detector system, high-speed data networks, and massively parallel computers to rapidly acquire the projection data and perform the reconstruction and rendering calculations. With the current setup, a data set can be obtained and reconstructed in tens of minutes. A specialized visualization computer makes rendered three-dimensional (3D) images available to the beamline users minutes after the data acquisition is completed. This system is capable of examining a large number of samples at sub-μm 3D resolution or studying the full 3D structure of a dynamically evolving sample on a 10 min temporal scale. In the near future, we expect to increase the spatial resolution to be...

...read moreread less

146 citations

Enabling Scalable Virtual Organizations

[...]

Ian Foster, Carl Kesselman, Steven Tuecke

01 Jan 2001

TL;DR: An extensible and open Grid architecture is presented which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing and it is maintained that Grid concepts and technologies complement and have much to contribute to these other approaches.

...read moreread less

Abstract: “Grid” computing has emerged as an important new field, distinguished from conventional distributed computing by its focus on large-scale resource sharing, innovative applications, and, in some cases, high-performance orientation. In this article, we define this new field. First, we review the “Grid problem,” which we define as flexible, secure, coordinated resource sharing among dynamic collections of individuals, institutions, and resources—what we refer to as virtual organizations. In such settings, we encounter unique authentication, authorization, resource access, resource discovery, and other challenges. It is this class of problem that is addressed by Grid technologies. Next, we present an extensible and open Grid architecture ,i n which protocols, services, application programming interfaces, and software development kits are categorized according to their roles in enabling resource sharing. We describe requirements that we believe any such mechanisms must satisfy and we discuss the importance of defining a compact set of intergrid protocols to enable interoperability among different Grid systems. Finally, we discuss how Grid technologies relate to other contemporary technologies, including enterprise integration, application service provider, storage service provider, and peer-to-peer computing. We maintain that Grid concepts and technologies complement and have much to contribute to these other approaches.

...read moreread less

105 citations

Transformation Catalog Design for GriPhyN

[...]

Ewa Deelman, Carl Kesselman

01 Jan 2001

33 citations

J3.3 The Earth System Grid II: Turning Climate Datasets Into Community Resources

[...]

Ian Foster¹, Ethan Alpert, Ann L. Chervenak, Bob Drach, Carl Kesselman, Veronika Nefedova, Don Middleton, Arie Shoshani, Alex Sim, Dean C. Williams, Marina del Ray - Show less +7 more•Institutions (1)

Argonne National Laboratory¹

01 Jan 2001

TL;DR: The U.S. Department of Energy (DOE) is supporting an advanced climate simulation program that is aimed at accelerating the execution of climate models one hundredfold by 2005 relative to the execution rate of today as mentioned in this paper.

...read moreread less

Abstract: Global coupled Earth System models are vital tools for understanding potential future changes in our climate. As we move towards mid-decade, we will see new model realizations with higher grid resolution and the integration of many additional complex processes. The U.S. Department of Energy (DOE) is supporting an advanced climate simulation program that is aimed at accelerating the execution of climate models one hundred-fold by 2005 relative to the execution rate of today.

...read moreread less

31 citations

Proceedings Article•DOI•

Practical resource management for grid-based visual exploration

[...]

Karl Czajkowski¹, Alper Kamil Demir¹, Carl Kesselman¹, Marcus Thiebaux¹•Institutions (1)

University of Southern California¹

07 Aug 2001

TL;DR: A new advance-reservation system for graphics resources; and an application of existing grid technology to create general-purpose active storage systems for grid computing environments are described.

...read moreread less

Abstract: Computational grids are enabling collaboration between scientists and organizations to generate and archive extremely large datasets across shared, distributed resources. There is a need to visually explore such data throughout the life-cycle of projects. Practical exploration of large datasets requires visualization tools that can function in the same grid environment in which the data is created and stored. Resource management interfaces are an important structural component of grid computing environments because they enable uniform access to the wide variety of resources necessary for scientific work. We describe a new advance-reservation system for graphics resources; and an application of existing grid technology to create general-purpose active storage systems. We report our experience with prototype infrastructure and application components, involving experiments coupling end-to-end resources for interactive visual exploration of large data in representative distributed environments.

...read moreread less

Journal Article•DOI•

Generalized communicators in the message passing interface

[...]

E.D. Demaine¹, Ian Foster², Carl Kesselman³, Marc Snir⁴, Marc Snir⁵ - Show less +1 more•Institutions (5)

University of Waterloo¹, Argonne National Laboratory², University of Southern California³, National Center for Supercomputing Applications⁴, University of Illinois at Urbana–Champaign⁵

01 Jun 2001-IEEE Transactions on Parallel and Distributed Systems

TL;DR: Extensions to the message passing interface (MPI) are proposed that generalize the MPI communicator concept to allow multiple communication endpoints per process, dynamic creation of endpoints, and the transfer of end points between processes.

...read moreread less

Abstract: We propose extensions to the message passing interface (MPI) that generalize the MPI communicator concept to allow multiple communication endpoints per process, dynamic creation of endpoints, and the transfer of endpoints between processes. The generalized communicator construct can be used to express a wide range of interesting communication structures, including collective communication operations involving multiple threads per process, communications between dynamically created threads or processes, and object-oriented applications in which communications are directed to specific objects. Furthermore, this enriched functionality can be provided in a manner that preserves backward compatibility with MPI. We describe the proposed extensions, illustrate their use with examples, and describe a prototype implementation in the popular MPI implementation MPICH.

...read moreread less

Journal Article•DOI•

Web access to supercomputing

[...]

G. Alosio, Massimo Cafaro, Carl Kesselman, Roy Williams

01 Nov 2001-Computing in Science and Engineering

TL;DR: A Web-based, grid-enabled application that processes, analyzes, and delivers remote-sensing images provides an example of the technology at work.

...read moreread less

Abstract: Theoretically, computational and data grids are the computing paradigm of choice, but they will not gain wide acceptance until users have seamless access to them. The authors describe how to provide comfortable, intuitive, yet powerful Web access to supercomputing, A Web-based, grid-enabled application that processes, analyzes, and delivers remote-sensing images provides an example of the technology at work.

...read moreread less

DRAFT: COMMENTS SOLICITED Transformation Catalog Design for GriPhyN

[...]

Ewa Deelman, Carl Kesselman, Gaurang Mehta

01 Jan 2001

TL;DR: This document aims to specify a schema for a transformation catalog, which holds information regarding the transformation files and their location, in the context of the LIGO prototype.

...read moreread less

Abstract: As part of the GriPhyn project (www.griphyn.org) we are developing a transformation catalog, which will be used to store information about the transformations which need to be used in order to process data as requested by the user. The catalog can be used to process raw or derived data products. Data derived using the transformations will be possibly placed in the Metadata catalog or the Replica Catalog[1] . The transformation catalog will be evaluated in the context of the LIGO prototype. The existing prototype is able to take an input of channel names and starting and ending time and then can query the replica catalog for the location of the requested data. If the files are not found, computation used to produce them is scheduled on LIGO’s computational resources. The granularity of LIGO data frames is set to 50sec time frames. Frames can be concatenated to form larger time frames. The results are provided to the user as a URL. The newly produced 50 sec frames are then entered into the replica catalog for future reference. The concatenated file returned to the user is not reflected in the replica catalog. Currently, the prototype uses only two transformations. By using the transformation catalog, the user will be able to ask for a rich set of derived data. This will be another step towards the realization of the concept of Virtual Data, where the user/application can request data whether it is materialized or not. If the materialized data is not available or the cost of accessing the processed data is greater then the cost of applying the transformation then we need obtain information about where the appropriate transformation is located. This document aims to specify a schema for a transformation catalog, which holds information regarding the transformation files and their location.

...read moreread less

GriPhyN/PPDG Data Grid Architecture, Toolkit, and Roadmap — Version 2 —

[...]

Ian Foster, Carl Kesselman

01 Jan 2001

Book Chapter•DOI•

A Virtual Data Grid for LIGO

[...]

Ewa Deelman¹, Carl Kesselman¹, Roy Williams², Albert Lazzarini², Thomas A. Prince², Joe Romano³, Benjamin William Allen⁴ - Show less +3 more•Institutions (4)

University of Southern California¹, California Institute of Technology², University of Texas at Austin³, University of Wisconsin-Madison⁴

25 Jun 2001

TL;DR: The physics and computing challenges of LIGO are explained, and the tools that GriPhyN will build to address them are explained.

...read moreread less

Abstract: GriPhyN (Grid Physics Network) is a large US collaboration to build grid services for large physics experiments, one of which is LIGO, a gravitational-wave observatory. This paper explains the physics and computing challenges of LIGO, and the tools that GriPhyN will build to address them. A key component needed to implement the data pipeline is a virtual data service; a system to dynamically create data products requested during the various stages. The data could possibly be already processed in a certain way, it may be in a file on a storage system, it may be cached, or it may need to be created through computation. The full elaboration of this system will allow complex data pipelines to be set up as virtual data objects, with existing data being transformed in diverse ways.

...read moreread less

Book Chapter•DOI•

Applications of Virtual Data in the LIGO Experiment

[...]

Ewa Deelman¹, Carl Kesselman¹, Roy Williams, Kent Blackburn², Albert Lazzarini², Scott Koranda - Show less +2 more•Institutions (2)

Information Sciences Institute¹, California Institute of Technology²

09 Sep 2001

TL;DR: Some of the challenges associated with building Virtual Data Grids for experiments such as LIGO are described.

...read moreread less

Abstract: Many Physics experiments today generate large volumes of data. That data is then processed in many ways in order to achieve the understanding of fundamental physical phenomena. Virtual Data is a concept that unifies the view of the data whether it is raw or derived. It provides a new degree of transparency in how data-handling and processing capabilities are integrated to deliver data products to end-users or applications, so that requests for such products are easily mapped into computation and/or data access at multiple locations. GriPhyN (Grid Physics Network) is a NSF-funded project, which aims to realize the concepts of Virtual Data. Among the physics applications participating in the project is the Laser Interferometer Gravitational-wave Observatory (LIGO), which is being built to observe the gravitational waves predicted by general relativity. LIGO will produce large amounts of data, which are expected to reach hundreds of petabytes over the next decade. Large communities of scientists, distributed around the world, need to access parts of these datasets and perform efficient analysis on them. It is expected that the raw and processed data will be distributed among various national centers, university computing centers, and individual workstations. In this paper we describe some of the challenges associated with building Virtual Data Grids for experiments such as LIGO.

...read moreread less

Journal Article•

Computational grids: Invited talk

[...]

Ian Foster, Carl Kesselman

01 Jan 2001-Lecture Notes in Computer Science

TL;DR: This introductory chapter lays the groundwork for the rest of the book by providing a more detailed picture of the expected purpose, shape, and architecture of future grid systems.

...read moreread less

Abstract: In this introductory chapter, we lay the groundwork for the rest of the book by providing a more detailed picture of the expected purpose, shape, and architecture of future grid systems. We structure the chapter in terms of six questions that we believe are central to this discussion: Why do we need computational grids? What types of applications will grids be used for? Who will use grids? How will grids be used? What is involved in building a grid? And, what problems must be solved to make grids commonplace? We provide an overview of each of these issues here, referring to subsequent chapters for more detailed discussion.

...read moreread less

A high-throughput x-ray microtomography system at the Advanced

[...]

Photon Source, Yuxin Wang, Francesco De Carlo, Derrick C. Mancini, Ian McNulty, Brian Tieman, John Bresnahan, Ian Foster, Joseph A. Insley, Peter Lane, Gregor von Laszewski, Carl Kesselman, Mei-Hui Su, Marcus Thiebaux - Show less +10 more

01 Jan 2001

TL;DR: In this article, a pipelined data acquisition and reconstruction system that combines a fast detector system, high-speed datanetworks, and massively parallel computers to rapidly acquire the projection data and perform thereconstruction and rendering calculations is presented.

...read moreread less

Abstract: ~Received 14 November 2000; accepted for publication 23 January 2001!A third-generation synchrotron radiation source provides enough brilliance to acquire completetomographic data sets at 100 nm or better resolution in a few minutes. To take advantage of suchhigh-brilliance sources at the Advanced Photon Source, we have constructed a pipelined dataacquisition and reconstruction system that combines a fast detector system, high-speed datanetworks, and massively parallel computers to rapidly acquire the projection data and perform thereconstruction and rendering calculations. With the current setup, a data set can be obtained andreconstructed in tens of minutes. A specialized visualization computer makes renderedthree-dimensional~3D! images available to the beamline users minutes after the data acquisition iscompleted. This system is capable of examining a large number of samples at sub-mm 3D resolutionor studying the full 3D structure of a dynamically evolving sample on a 10 min temporal scale. Inthe near future, we expect to increase the spatial resolution to below 100 nm by using zone-platex-ray focusing optics and to improve the time resolution by the use of a broadband x-raymonochromator and a faster detector system. © 2001 American Institute of Physics.@DOI: 10.1063/1.1355270#

...read moreread less

Posted Content•

Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing

[...]

Bill Allcock, Joe Bester, John Bresnahan, Ann L. Chervenak, Ian Foster, Carl Kesselman, Sam Meder, Veronika Nefedova, Darcy Quesnel, Steven Tuecke - Show less +6 more

28 Mar 2001-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: In this paper, the authors describe two services that are fundamental to any Data Grid: reliable, high-speed transporet and replica management, which are implemented using Globus Toolkit.

...read moreread less

Abstract: An emerging class of data-intensive applications involve the geographically dispersed extraction of complex scientific information from very large collections of measured or computed data. Such applications arise, for example, in experimental physics, where the data in question is generated by accelerators, and in simulation science, where the data is generated by supercomputers. So-called Data Grids provide essential infrastructure for such applications, much as the Internet provides essential services for applications such as e-mail and the Web. We describe here two services that we believe are fundamental to any Data Grid: reliable, high-speed transporet and replica management. Our high-speed transport service, GridFTP, extends the popular FTP protocol with new features required for Data Grid applciations, such as striping and partial file access. Our replica management service integrates a replica catalog with GridFTP transfers to provide for the creation, registration, location, and management of dataset replicas. We present the design of both services and also preliminary performance results. Our implementations exploit security and other services provided by the Globus Toolkit.

...read moreread less