Home
/
Authors
/
Galen M. Shipman

Author

Galen M. Shipman

Other affiliations: National Center for Computational Sciences, Oak Ridge National Laboratory

Bio: Galen M. Shipman is an academic researcher from Los Alamos National Laboratory. The author has contributed to research in topics: File system & Lustre (file system). The author has an hindex of 27, co-authored 83 publications receiving 2103 citations. Previous affiliations of Galen M. Shipman include National Center for Computational Sciences & Oak Ridge National Laboratory.

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Open MPI: A High-Performance, Heterogeneous MPI

[...]

R.L. Graham¹, Galen M. Shipman¹, Brian Barrett², Ralph H. Castain¹, George Bosilca³, Andrew Lumsdaine² - Show less +2 more•Institutions (3)

Los Alamos National Laboratory¹, Indiana University², University of Tennessee³

25 Sep 2006

TL;DR: This work describes Open MPI's architecture for heterogeneous network and processor support, and demonstrates the transparency to the application developer while maintaining very high levels of performance.

...read moreread less

Abstract: The growth in the number of generally available, distributed, heterogeneous computing systems places increasing importance on the development of user-friendly tools that enable application developers to efficiently use these resources. Open MPI provides support for several aspects of heterogeneity within a single, open-source MPI implementation. Through careful abstractions, heterogeneous support maintains efficient use of uniform computational platforms. We describe Open MPI's architecture for heterogeneous network and processor support. A key design features of this implementation is the transparency to the application developer while maintaining very high levels of performance. This is demonstrated with the results of several numerical experiments.

...read moreread less

152 citations

Patent•

Coordinated garbage collection for raid array of solid state disks

[...]

David A. Dillow¹, Youngjae Kim¹, Hakki S. Oral¹, Galen M. Shipman¹, Feiyi Wang¹ - Show less +1 more•Institutions (1)

Oak Ridge National Laboratory¹

28 Jan 2011

TL;DR: In this paper, an optimized redundant array of solid state devices may include an array of one or more optimized solid-state devices and a controller coupled to the optimized solidstate devices for managing the devices.

...read moreread less

Abstract: An optimized redundant array of solid state devices may include an array of one or more optimized solid-state devices and a controller coupled to the solid-state devices for managing the solid-state devices. The controller may be configured to globally coordinate the garbage collection activities of each of said optimized solid-state devices, for instance, to minimize the degraded performance time and increase the optimal performance time of the entire array of devices.

...read moreread less

138 citations

Journal Article•DOI•

The Earth System Grid Federation: An open infrastructure for access to distributed geospatial data

[...]

Luca Cinquini¹, Luca Cinquini², Daniel J. Crichton², Daniel J. Crichton¹, Chris A. Mattmann², Chris A. Mattmann¹, John Harney³, Galen M. Shipman³, Feiyi Wang³, Rachana Ananthakrishnan⁴, Rachana Ananthakrishnan⁵, Neill Miller⁵, Neill Miller⁴, Sebastian Denvil, Mark Morgan, Zed Pobre⁶, Gavin M. Bell⁷, Charles Doutriaux⁷, R. Drach⁷, Dean N. Williams⁷, Philip Kershaw⁸, Stephen Pascoe⁸, Estanislao Gonzalez⁹, Estanislao Gonzalez¹⁰, Sandro Fiore¹¹, Roland Schweitzer¹² - Show less +22 more•Institutions (12)

Jet Propulsion Laboratory¹, California Institute of Technology², Oak Ridge National Laboratory³, Argonne National Laboratory⁴, University of Chicago⁵, Goddard Space Flight Center⁶, Lawrence Livermore National Laboratory⁷, Rutherford Appleton Laboratory⁸, Free University of Berlin⁹, German Climate Computing Centre¹⁰, Central Maine Community College¹¹, Pacific Marine Environmental Laboratory¹²

01 Jul 2014-Future Generation Computer Systems

TL;DR: ESGF is presented as a successful example of integration of disparate open source technologies into a cohesive, wide functional system to serve the needs of the global climate science community.

...read moreread less

122 citations

Proceedings Article•DOI•

A semi-preemptive garbage collector for solid state drives

[...]

Junghee Lee¹, Youngjae Kim², Galen M. Shipman², Sarp Oral², Feiyi Wang², Jongman Kim¹ - Show less +2 more•Institutions (2)

Georgia Institute of Technology¹, National Center for Computational Sciences²

10 Apr 2011

TL;DR: This paper examines the GC process and proposes a semi-preemptive GC scheme that can preempt on-going GC processing and service pending I/O requests in the queue that can enhance flash performance by pipelining internal GC operations and merge them with pending I-O requests whenever possible.

...read moreread less

Abstract: NAND flash memory is a preferred storage media for various platforms ranging from embedded systems to enterprise-scale systems. Flash devices do not have any mechanical moving parts and provide low-latency access. They also require less power compared to rotating media. Unlike hard disks, flash devices use out-of-update operations and they require a garbage collection (GC) process to reclaim invalid pages to create free blocks. This GC process is a major cause of performance degradation when running concurrently with other I/O operations as internal bandwidth is consumed to reclaim these invalid pages. The invocation of the GC process is generally governed by a low watermark on free blocks and other internal device metrics that different workloads meet at different intervals. This results in I/O performance that is highly dependent on workload characteristics. In this paper, we examine the GC process and propose a semi-preemptive GC scheme that can preempt on-going GC processing and service pending I/O requests in the queue. Moreover, we further enhance flash performance by pipelining internal GC operations and merge them with pending I/O requests whenever possible. Our experimental evaluation of this semi-preemptive GC sheme with realistic workloads demonstrate both improved performance and reduced performance variability. Write-dominant workloads show up to a 66.56% improvement in average response time with a 83.30% reduced variance in response time compared to the non-preemptive GC scheme.

...read moreread less

118 citations

Journal Article•DOI•

Big data and deep data in scanning and electron microscopies: deriving functionality from multidimensional data sets.

[...]

Alex Belianinov¹, Rama K. Vasudevan¹, Evgheni Strelcov¹, Chad A. Steed¹, Sang Mo Yang¹, Alexander Tselev¹, Stephen Jesse¹, Michael D. Biegalski¹, Galen M. Shipman², Christopher T. Symons¹, Albina Y. Borisevich¹, Rick Archibald¹, Sergei V. Kalinin¹ - Show less +9 more•Institutions (2)

Oak Ridge National Laboratory¹, Los Alamos National Laboratory²

13 May 2015-Advanced Structural and Chemical Imaging

TL;DR: Here, several recent applications of the big and deep data analysis methods are reviewed to visualize, compress, and translate this multidimensional structural and functional data into physically and chemically relevant information.

...read moreread less

Abstract: The development of electron and scanning probe microscopies in the second half of the twentieth century has produced spectacular images of the internal structure and composition of matter with nanometer, molecular, and atomic resolution. Largely, this progress was enabled by computer-assisted methods of microscope operation, data acquisition, and analysis. Advances in imaging technology in the beginning of the twenty-first century have opened the proverbial floodgates on the availability of high-veracity information on structure and functionality. From the hardware perspective, high-resolution imaging methods now routinely resolve atomic positions with approximately picometer precision, allowing for quantitative measurements of individual bond lengths and angles. Similarly, functional imaging often leads to multidimensional data sets containing partial or full information on properties of interest, acquired as a function of multiple parameters (time, temperature, or other external stimuli). Here, we review several recent applications of the big and deep data analysis methods to visualize, compress, and translate this multidimensional structural and functional data into physically and chemically relevant information.

...read moreread less

101 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales

[...]

Yusry O. El-Dib¹, Aidan P. Thompson¹, H. Metin Aktulga², Richard A. Berger³, Dan S. Bolintineanu¹, W. Michael Brown⁴, Paul Stewart Crozier¹, Pieter J. in 't Veld, Axel Kohlmeyer³, Stan Gerald Moore¹, Trung Dac Nguyen⁵, Ray Shan, Mark J. Stevens¹, Julien Tranchida¹, Christian Robert Trott¹, Steven J. Plimpton¹ - Show less +12 more•Institutions (5)

Sandia National Laboratories¹, Michigan State University², Temple University³, Intel⁴, Northwestern University⁵

01 Feb 2022-Computer Physics Communications

TL;DR: Several of the fundamental algorithms used in LAMMPS are described along with the design strategies which have made it flexible for both users and developers, and some capabilities recently added to the code which were enabled by this flexibility are highlighted.

...read moreread less

1,956 citations

Journal Article•DOI•

LAMMPS - a flexible simulation tool for particle-based materials modeling at the atomic, meso, and continuum scales

[...]

Yusry O. El-Dib¹•Institutions (1)

Sandia National Laboratories¹

01 Feb 2022-Computer Physics Communications

TL;DR: The Large-scale Atomic/Molecular Massively Parallel Simulator (LAMMPS) as mentioned in this paper is a simulator for particle-based modeling of materials at length scales ranging from atomic to mesoscale to continuum.

...read moreread less

1,517 citations

The Transmission Control Protocol.

[...]

Aleksander Malinowski, Bogdan M. Wilamowski

01 Jan 2005

1,360 citations

Proceedings Article•DOI•

Workload analysis of a large-scale key-value store

[...]

Berk Atikoglu¹, Yuehai Xu², Eitan Frachtenberg³, Song Jiang², Mike Paleczny³ - Show less +1 more•Institutions (3)

Stanford University¹, Wayne State University², Facebook³

11 Jun 2012

TL;DR: This paper collects detailed traces from Facebook's Memcached deployment, arguably the world's largest, and analyzes the workloads from multiple angles, including: request composition, size, and rate; cache efficacy; temporal patterns; and application use cases.

...read moreread less

Abstract: Key-value stores are a vital component in many scale-out enterprises, including social networks, online retail, and risk analysis. Accordingly, they are receiving increased attention from the research community in an effort to improve their performance, scalability, reliability, cost, and power consumption. To be effective, such efforts require a detailed understanding of realistic key-value workloads. And yet little is known about these workloads outside of the companies that operate them. This paper aims to address this gap.To this end, we have collected detailed traces from Facebook's Memcached deployment, arguably the world's largest. The traces capture over 284 billion requests from five different Memcached use cases over several days. We analyze the workloads from multiple angles, including: request composition, size, and rate; cache efficacy; temporal patterns; and application use cases. We also propose a simple model of the most representative trace to enable the generation of more realistic synthetic workloads by the community.Our analysis details many characteristics of the caching workload. It also reveals a number of surprises: a GET/SET ratio of 30:1 that is higher than assumed in the literature; some applications of Memcached behave more like persistent storage than a cache; strong locality metrics, such as keys accessed many millions of times a day, do not always suffice for a high hit rate; and there is still room for efficiency and hit rate improvements in Memcached's implementation. Toward the last point, we make several suggestions that address the exposed deficiencies.

...read moreread less

880 citations

Journal Article•DOI•

A review on regional convection-permitting climate modeling: Demonstrations, prospects, and challenges.

[...]

Andreas F. Prein¹, Andreas F. Prein², Wolfgang Langhans³, Giorgia Fosser⁴, Andrew Ferrone, Nikolina Ban⁵, Klaus Goergen⁶, Michael Keller⁵, Merja Tölle⁷, Oliver Gutjahr⁸, Frauke Feser, Erwan Brisson, Stefan Kollet, Juerg Schmidli⁵, Nicole Van Lipzig⁹, Ruby Leung¹⁰ - Show less +12 more•Institutions (10)

University of Graz¹, National Center for Atmospheric Research², Lawrence Berkeley National Laboratory³, Centre national de la recherche scientifique⁴, ETH Zurich⁵, University of Bonn⁶, University of Giessen⁷, University of Trier⁸, Katholieke Universiteit Leuven⁹, Pacific Northwest National Laboratory¹⁰

01 Jun 2015-Reviews of Geophysics

TL;DR: This study aims to provide a common basis for CPM climate simulations by giving a holistic review of the topic, and presents the consolidated outcome of studies that addressed the added value of CPMClimate simulations compared to LSMs.

...read moreread less

Abstract: Regional climate modeling using convection-permitting models (CPMs; horizontal grid spacing 10 km). CPMs no longer rely on convection parameterization schemes, which had been identified as a major source of errors and uncertainties in LSMs. Moreover, CPMs allow for a more accurate representation of surface and orography fields. The drawback of CPMs is the high demand on computational resources. For this reason, first CPM climate simulations only appeared a decade ago. In this study, we aim to provide a common basis for CPM climate simulations by giving a holistic review of the topic. The most important components in CPMs such as physical parameterizations and dynamical formulations are discussed critically. An overview of weaknesses and an outlook on required future developments is provided. Most importantly, this review presents the consolidated outcome of studies that addressed the added value of CPM climate simulations compared to LSMs. Improvements are evident mostly for climate statistics related to deep convection, mountainous regions, or extreme events. The climate change signals of CPM simulations suggest an increase in flash floods, changes in hail storm characteristics, and reductions in the snowpack over mountains. In conclusion, CPMs are a very promising tool for future climate research. However, coordinated modeling programs are crucially needed to advance parameterizations of unresolved physics and to assess the full potential of CPMs.

...read moreread less

833 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse