Home
/
Authors
/
Ben Clifford

Author

Ben Clifford

Other affiliations: National Center for Supercomputing Applications, Argonne National Laboratory, University of Southern California

Bio: Ben Clifford is an academic researcher from University of Chicago. The author has contributed to research in topics: Scripting language & Petascale computing. The author has an hindex of 15, co-authored 22 publications receiving 2475 citations. Previous affiliations of Ben Clifford include National Center for Supercomputing Applications & Argonne National Laboratory.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The Open Provenance Model core specification (v1.1)

[...]

Luc Moreau, Ben Clifford¹, Juliana Freire, Joe Futrelle¹, Yolanda Gil², Paul Groth³, Natalia Kwasnikowska, Simon Miles⁴, Paolo Missier, James D. Myers¹, Beth Plale⁵, Yogesh Simmhan⁶, Eric G. Stephan⁷, Jan Van den Bussche - Show less +10 more•Institutions (7)

National Center for Supercomputing Applications¹, Information Sciences Institute², VU University Amsterdam³, King's College London⁴, Indiana University⁵, Microsoft⁶, Pacific Northwest National Laboratory⁷

01 Jun 2011-Future Generation Computer Systems

TL;DR: This document contains the specification of the Open Provenance Model (v1.1) resulting from a community effort to achieve inter-operability in the Provenances Challenge series.

...read moreread less

762 citations

Journal Article•DOI•

Swift: A language for distributed parallel scripting

[...]

Michael Wilde¹, Mihael Hategan¹, Justin M. Wozniak¹, Ben Clifford², Daniel S. Katz¹, Ian Foster¹ - Show less +2 more•Institutions (2)

Argonne National Laboratory¹, University of Chicago²

01 Sep 2011

TL;DR: This work presents Swift's implicitly parallel and deterministic programming model, which applies external applications to file collections using a functional style that abstracts and simplifies distributed parallel execution.

...read moreread less

Abstract: Scientists, engineers, and statisticians must execute domain-specific application programs many times on large collections of file-based data. This activity requires complex orchestration and data management as data is passed to, from, and among application invocations. Distributed and parallel computing resources can accelerate such processing, but their use further increases programming complexity. The Swift parallel scripting language reduces these complexities by making file system structures accessible via language constructs and by allowing ordinary application programs to be composed into powerful parallel scripts that can efficiently utilize parallel and distributed resources. We present Swift's implicitly parallel and deterministic programming model, which applies external applications to file collections using a functional style that abstracts and simplifies distributed parallel execution.

...read moreread less

421 citations

Proceedings Article•DOI•

Swift: Fast, Reliable, Loosely Coupled Parallel Computation

[...]

Yong Zhao¹, Mihael Hategan¹, Ben Clifford², Ian Foster², G. von Laszewski², Veronika Nefedova², Ioan Raicu², T. Stef-Praun², Michael Wilde² - Show less +5 more•Institutions (2)

University of Chicago¹, Argonne National Laboratory²

09 Jul 2007

TL;DR: Swift adopts and adapts ideas first explored in the GriPhyN virtual data system, improving on that system in many regards and describes application experiences and performance experiments that quantify the cost of Swift operations.

...read moreread less

Abstract: We present Swift, a system that combines a novel scripting language called SwiftScript with a powerful runtime system based on CoG Karajan, Falkon, and Globus to allow for the concise specification, and reliable and efficient execution, of large loosely coupled computations. Swift adopts and adapts ideas first explored in the GriPhyN virtual data system, improving on that system in many regards. We describe the SwiftScript language and its use of XDTM to describe the logical structure of complex file system structures. We also present the Swift runtime system and its use of CoG Karajan, Falkon, and Globus services to dispatch and manage the execution of many tasks in parallel and grid environments. We describe application experiences and performance experiments that quantify the cost of Swift operations.

...read moreread less

387 citations

Proceedings Article•DOI•

The Grid2003 production grid: principles and practice

[...]

Ian Foster¹, J. Gieraltowski¹, S. Gose¹, Natalia Maltsev¹, E. May¹, Alex Rodriguez¹, Dinanath Sulakhe¹, Alexandre Vaniachine¹, James Shank², S. Youssef², D. Adams³, R. Baker³, W. Deng³, J. Smith³, Dantong Yu³, I. Legrand⁴, S. Singh⁴, Conrad Steenberg⁴, Y. Xia⁴, A. Afaq, E. Berman, James Annis, Lothar At Bauerdick, Michael Ernst, Ian Fisk, L. Giacchetti, G. Graham, A. Heavey, Jozef Kaiser, N. Kuropatkin, Ruth Pordes, V. Sekhri, J. Weigand, Y. Wu, K. Baker⁵, L. Sorrillo⁵, John Huth⁶, M. Allen⁷, L. Grundhoefer⁷, J. Hicks⁷, F. Luehring⁷, S. Peck⁷, Rob Quick⁷, Stephen C. Simms⁷, G. Fekete⁸, J. VandenBerg⁸, K. Cho, K. Kwon, D. Son, H. Park, Shane Canon⁹, Keith Jackson⁹, David E. Konerding⁹, Jason Lee⁹, Doug Olson⁹, I. Sakrejda⁹, Brian Tierney⁹, Mark L. Green, Russ Miller, James Letts, Tim Martin, D. Bury¹⁰, Catalin Dumitrescu¹⁰, D. Engh¹⁰, Robert Gardner¹⁰, M. Mambelli¹⁰, Y. Smirnov¹⁰, Jens Voeckler¹⁰, Michael Wilde¹⁰, Yong Zhao¹⁰, X. Zhao¹⁰, Paul Avery, Richard Cavanaugh, B. Kim, C.Y. Prescott, Jorge Luis Rodriguez, A. Zahn, Shawn McKee¹¹, Chris Jordan, J. Prewett, T. L. Thomas, Horst Severini, Ben Clifford, Ewa Deelman, L. Flon, Carl Kesselman, Gaurang Mehta, N. Olomu, Karan Vahi, K. De, P McGuigan, M. Sosebee, D. Bradley¹², Peter Couvares¹², A A De Smet¹², C. Kireyev¹², E. Paulson¹², Alain Roy¹², Scott Koranda, B. Moe, B. Brown¹³, Paul Sheldon¹³ - Show less +98 more•Institutions (13)

Argonne National Laboratory¹, Boston University², Brookhaven College³, California Institute of Technology⁴, Hampton University⁵, Harvard University⁶, Indiana University⁷, Johns Hopkins University⁸, Lawrence Berkeley National Laboratory⁹, University of Chicago¹⁰, University of Michigan¹¹, University of Wisconsin-Madison¹², Vanderbilt University¹³

04 Jun 2004

TL;DR: The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory that has sustained for several months the production-level services required by physics experiments of the Large Hadron Collider at CERN, the Sloan Digital Sky Survey project, the gravitational wave search experiment LIGO, the BTeV experiment at Fermilab, as well as applications in molecular structure analysis and genome analysis, and computer science research projects in such areas as job and data scheduling.

...read moreread less

Abstract: The Grid2003 Project has deployed a multivirtual organization, application-driven grid laboratory ("Grid3") that has sustained for several months the production-level services required by physics experiments of the Large Hadron Collider at CERN (ATLAS and CMS), the Sloan Digital Sky Survey project, the gravitational wave search experiment LIGO, the BTeV experiment at Fermilab, as well as applications in molecular structure analysis and genome analysis, and computer science research projects in such areas as job and data scheduling. The deployed infrastructure has been operating since November 2003 with 27 sites, a peak of 2800 processors, work loads from 10 different applications exceeding 1300 simultaneous jobs, and data transfers among sites of greater than 2 TB/day. We describe the principles that have guided the development of this unique infrastructure and the practical experiences that have resulted from its creation and use. We discuss application requirements for grid services deployment and configuration, monitoring infrastructure, application performance, metrics, and operational experiences. We also summarize lessons learned.

...read moreread less

138 citations

Journal Issue•DOI•

Special Issue: The First Provenance Challenge

[...]

Luc Moreau, Bertram Ludäscher, Ilkay Altintas, Roger Barga, Shawn Bowers, Steven P. Callahan, George Chin, Ben Clifford, Shirley Cohen, Sarah Cohen-Boulakia, Susan B. Davidson, Ewa Deelman, Luciano Antonio Digiampietri, Ian Foster, Juliana Freire, James Frew, Joe Futrelle, Tara Gibson, Yolanda Gil, Carole Goble, Jennifer Golbeck, Paul Groth, David A. Holland, Sheng Jiang, Jihie Kim, David Koop, Ales Krenek, Timothy M. McPhillips, Gaurang Mehta, Simon Miles, Dominic Metzger, Steve Munroe, James D. Myers, Beth Plale, Norbert Podhorszki, Varun Ratnakar, Emanuele Santos, Carlos Scheidegger, Karen Schuchardt, Margo Seltzer, Yogesh Simmhan, Cláudio T. Silva, Peter Slaughter, Eric G. Stephan, Robert Stevens, Daniele Turi, Huy T. Vo, Michael Wilde, Jun Zhao, Yong Zhao - Show less +46 more

01 Apr 2008-Concurrency and Computation: Practice and Experience

TL;DR: A functional magnetic resonance imaging workflow was defined, which participants had to either simulate or run in order to produce some provenance representation, from which a set of identified queries had to be implemented and executed.

...read moreread less

Abstract: The first Provenance Challenge was set up in order to provide a forum for the community to understand the capabilities of different provenance systems and the expressiveness of their provenance representations. To this end, a functional magnetic resonance imaging workflow was defined, which participants had to either simulate or run in order to produce some provenance representation, from which a set of identified queries had to be implemented and executed. Sixteen teams responded to the challenge, and submitted their inputs. In this paper, we present the challenge workflow and queries, and summarize the participants' contributions. Copyright © 2007 John Wiley & Sons, Ltd.

...read moreread less

119 citations

1
2
3
4
…
5

Cited by

PDF

Open Access

More filters

Journal Article•

다중혈관 관상동맥 환자에서 y-문합을 이용하여 양쪽 내흉동맥만을 사용한 우회술의 조기 성적

[...]

성기익, 이영탁, 박계현, 전태국, 박표원, 한일용, 장윤희 - Show less +3 more

01 Mar 2003-The Korean Journal of Thoracic and Cardiovascular Surgery

28,685 citations

Proceedings Article•DOI•

Cloud Computing and Grid Computing 360-Degree Compared

[...]

Ian Foster¹, Yong Zhao², Ioan Raicu¹, Shiyong Lu³•Institutions (3)

University of Chicago¹, Microsoft², Wayne State University³

01 Nov 2008

TL;DR: In this article, the authors compare and contrast cloud computing with grid computing from various angles and give insights into the essential characteristics of both the two technologies, and compare the advantages of grid computing and cloud computing.

...read moreread less

Abstract: Cloud computing has become another buzzword after Web 2.0. However, there are dozens of different definitions for cloud computing and there seems to be no consensus on what a cloud is. On the other hand, cloud computing is not a completely new concept; it has intricate connection to the relatively new but thirteen-year established grid computing paradigm, and other relevant technologies such as utility computing, cluster computing, and distributed systems in general. This paper strives to compare and contrast cloud computing with grid computing from various angles and give insights into the essential characteristics of both.

...read moreread less

3,132 citations

Book Chapter•DOI•

Globus toolkit version 4: software for service-oriented systems

[...]

Ian Foster¹•Institutions (1)

Argonne National Laboratory¹

30 Nov 2005

TL;DR: The principal characteristics of the latest release, the Web services-based GT4, which provides significant improvements over previous releases in terms of robustness, performance, usability, documentation, standards compliance, and functionality are summarized.

...read moreread less

Abstract: The Globus Toolkit (GT) has been developed since the late 1990s to support the development of service-oriented distributed computing applications and infrastructures. Core GT components address, within a common framework, basic issues relating to security, resource access, resource management, data movement, resource discovery, and so forth. These components enable a broader “Globus ecosystem” of tools and components that build on, or interoperate with, core GT functionality to provide a wide range of useful application-level functions. These tools have in turn been used to develop a wide range of both “Grid” infrastructures and distributed applications. I summarize here the principal characteristics of the latest release, the Web services-based GT4, which provides significant improvements over previous releases in terms of robustness, performance, usability, documentation, standards compliance, and functionality.

...read moreread less

1,509 citations

Proceedings Article•

The Grid 2: Blueprint for a New Computing Infrastructure

[...]

R.V. van Nieuwpoort

01 Jan 2003

1,212 citations

Journal Article•DOI•

The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments.

[...]

Krzysztof J. Gorgolewski¹, Tibor Auer², Vince D. Calhoun³, R. Cameron Craddock⁴, Samir Das⁵, Eugene P. Duff⁶, Guillaume Flandin⁷, Satrajit S. Ghosh, Tristan Glatard, Yaroslav O. Halchenko⁸, Daniel A. Handwerker⁹, Michael Hanke¹⁰, David Keator¹¹, Xiangrui Li¹², Zachary Michael, Camille Maumet¹³, B. Nolan Nichols¹, Thomas E. Nichols¹³, John Pellman¹⁴, Jean-Baptiste Poline¹⁵, Jean-Baptiste Poline¹⁶, Ariel Rokem¹⁷, Gunnar Schaefer¹⁸, Vanessa Sochat¹, William Triplett¹, Jessica A. Turner¹⁸, Gaël Varoquaux, Russell A. Poldrack¹⁹ - Show less +24 more•Institutions (19)

Stanford University¹, Cognition and Brain Sciences Unit², The Mind Research Network³, Nathan Kline Institute for Psychiatric Research⁴, Montreal Neurological Institute and Hospital⁵, University of Oxford⁶, Wellcome Trust Centre for Neuroimaging⁷, Dartmouth College⁸, National Institutes of Health⁹, Otto-von-Guericke University Magdeburg¹⁰, University of California, Irvine¹¹, Shandong University¹², University of Warwick¹³, MIND Institute¹⁴, Helen Wills Neuroscience Institute¹⁵, Lawrence Berkeley National Laboratory¹⁶, University of Washington¹⁷, Georgia State University¹⁸, California Institute of Technology¹⁹

21 Jun 2016-Scientific Data

TL;DR: The Brain Imaging Data Structure (BIDS) is developed, a standard for organizing and describing MRI datasets that uses file formats compatible with existing software, unifies the majority of practices already common in the field, and captures the metadata necessary for most common data processing operations.

...read moreread less

Abstract: The development of magnetic resonance imaging (MRI) techniques has defined modern neuroimaging. Since its inception, tens of thousands of studies using techniques such as functional MRI and diffusion weighted imaging have allowed for the non-invasive study of the brain. Despite the fact that MRI is routinely used to obtain data for neuroscience research, there has been no widely adopted standard for organizing and describing the data collected in an imaging experiment. This renders sharing and reusing data (within or between labs) difficult if not impossible and unnecessarily complicates the application of automatic pipelines and quality assurance protocols. To solve this problem, we have developed the Brain Imaging Data Structure (BIDS), a standard for organizing and describing MRI datasets. The BIDS standard uses file formats compatible with existing software, unifies the majority of practices already common in the field, and captures the metadata necessary for most common data processing operations.

...read moreread less

1,037 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse