Showing papers by "Michael Wilde published in 2012"

PDF

Open Access

Proceedings Article•DOI•

A Workflow-Aware Storage System: An Opportunity Study

[...]

Emalayan Vairavanathan¹, Samer Al-Kiswany¹, Lauro Beltrao Costa¹, Zhao Zhang², Daniel S. Katz³, Michael Wilde³, Matei Ripeanu¹ - Show less +3 more•Institutions (3)

University of British Columbia¹, University of Chicago², Argonne National Laboratory³

13 May 2012

TL;DR: The evaluation using synthetic benchmarks shows that a workflow-aware storage system can bring significant performance gains: up to 7× performance gain compared to the distributed storage system - MosaStore and up to 16× compared to a central, well provisioned, NFS server.

...read moreread less

Abstract: This paper evaluates the potential gains a workflow-aware storage system can bring. Two observations make us believe such storage system is crucial to efficiently support workflow-based applications: First, workflows generate irregular and application-dependent data access patterns. These patterns render existing storage systems unable to harness all optimization opportunities as this often requires conflicting optimization options or even conflicting design decision at the level of the storage system. Second, when scheduling, workflow runtime engines make suboptimal decisions as they lack detailed data location information. This paper discusses the feasibility, and evaluates the potential performance benefits brought by, building a workflow-aware storage system that supports per-file access optimizations and exposes data location. To this end, this paper presents approaches to determine the application-specific data access patterns, and evaluates experimentally the performance gains of a workflow-aware storage approach. Our evaluation using synthetic benchmarks shows that a workflow-aware storage system can bring significant performance gains: up to 7x performance gain compared to the distributed storage system - MosaStore and up to 16x compared to a central, well provisioned, NFS server.

...read moreread less

34 citations

Proceedings Article•DOI•

Turbine: a distributed-memory dataflow engine for extreme-scale many-task applications

[...]

Justin M. Wozniak¹, Timothy G. Armstrong², Ketan Maheshwari¹, Ewing Lusk¹, Daniel S. Katz¹, Michael Wilde¹, Ian Foster¹ - Show less +3 more•Institutions (2)

Argonne National Laboratory¹, University of Chicago²

20 May 2012

TL;DR: The architecture of Turbine is presented, a new highly scalable and distributed many-task dataflow engine that executes a generalized many- task intermediate representation with automated self-distribution, and is scalable to multi-petaflop infrastructures.

...read moreread less

Abstract: Efficiently utilizing the rapidly increasing concurrency of multi-petaflop computing systems is a significant programming challenge. One approach is to structure applications with an upper layer of many loosely-coupled coarse-grained tasks, each comprising a tightly-coupled parallel function or program. "Many-task" programming models such as functional parallel dataflow may be used at the upper layer to generate massive numbers of tasks, each of which generates significant tighly-coupled parallelism at the lower level via multithreading, message passing, and/or partitioned global address spaces. At large scales, however, the management of task distribution, data dependencies, and inter-task data movement is a significant performance challenge. In this work, we describe Turbine, a new highly scalable and distributed many-task dataflow engine. Turbine executes a generalized many-task intermediate representation with automated self-distribution, and is scalable to multi-petaflop infrastructures. We present here the architecture of Turbine and its performance on highly concurrent systems.

...read moreread less

28 citations

Journal Article•DOI•

Modeling large regions in proteins: applications to loops, termini, and folding.

[...]

Aashish N. Adhikari¹, Jian Peng², Michael Wilde¹, Jinbo Xu², Karl F. Freed¹, Tobin R. Sosnick¹ - Show less +2 more•Institutions (2)

University of Chicago¹, Toyota Technological Institute at Chicago²

01 Jan 2012-Protein Science

TL;DR: This work presents a free modeling method for predicting the local structure of loops and large InsEnds in both crystal structures and template‐based models, and ranks as one of the best in the CASP9 refinement category that involves improving template‐ based models so that they can function as molecular replacement models to solve the phase problem for crystallographic structure determination.

...read moreread less

Abstract: Template-based methods for predicting protein structure provide models for a significant portion of the protein but often contain insertions or chain ends (InsEnds) of indeterminate conformation. The local structure prediction ''problem'' entails modeling the InsEnds onto the rest of the protein. A well-known limit involves predicting loops of � 12 residues in crystal structures. However, InsEnds may contain as many as ~50 amino acids, and the template-based model of the protein itself may be imperfect. To address these challenges, we present a free modeling method for predicting the local structure of loops and large InsEnds in both crystal structures and template- based models. The approach uses single amino acid torsional angle ''pivot'' moves of the protein backbone with a Cb level representation. Nevertheless, our accuracy for loops is comparable to existing methods. We also apply a more stringent test, the blind structure prediction and refinement categories of the CASP9 tournament, where we improve the quality of several homology based models by modeling InsEnds as long as 45 amino acids, sizes generally inaccessible to existing loop prediction methods. Our approach ranks as one of the best in the CASP9 refinement category that involves improving template-based models so that they can function as molecular replacement models to solve the phase problem for crystallographic structure determination.

...read moreread less

26 citations

Journal Article•DOI•

MTCProv: a practical provenance query framework for many-task scientific computing

[...]

Luiz M. R. Gadelha¹, Michael Wilde², Marta Mattoso¹, Ian Foster²•Institutions (2)

Federal University of Rio de Janeiro¹, Argonne National Laboratory²

01 Oct 2012-Distributed and Parallel Databases

TL;DR: MTCProv is contributed with MTCProv, a provenance query framework for many-task scientific computing that captures the runtime execution details of MTC workflow tasks on parallel and distributed systems, in addition to standard prospective and data derivation provenance.

...read moreread less

Abstract: Scientific research is increasingly assisted by computer-based experiments. Such experiments are often composed of a vast number of loosely-coupled computational tasks that are specified and automated as scientific workflows. This large scale is also characteristic of the data that flows within such "many-task" computations (MTC). Provenance information can record the behavior of such computational experiments via the lineage of process and data artifacts. However, work to date has focused on lineage data models, leaving unsolved issues of recording and querying other aspects, such as domain-specific information about the experiments, MTC behavior given by resource consumption and failure information, or the impact of environment on performance and accuracy. In this work we contribute with MTCProv, a provenance query framework for many-task scientific computing that captures the runtime execution details of MTC workflow tasks on parallel and distributed systems, in addition to standard prospective and data derivation provenance. To help users query provenance data we provide a high level interface that hides relational query complexities. We evaluate MTCProv using an application in protein science, and describe how important query patterns such as correlations between provenance, runtime data, and scientific parameters are simplified and expressed.

...read moreread less

22 citations

Posted Content•

Many-Task Computing and Blue Waters

[...]

Daniel S. Katz, Timothy G. Armstrong, Zhao Zhang, Michael Wilde, Justin M. Wozniak - Show less +1 more

17 Feb 2012-arXiv: Distributed, Parallel, and Cluster Computing

TL;DR: This report discusses many-task computing generically and in the context of the proposed Blue Waters systems, which is planned to be the largest NSF-funded supercomputer when it begins production use in 2012.

...read moreread less

Abstract: This report discusses many-task computing (MTC) generically and in the context of the proposed Blue Waters systems, which is planned to be the largest NSF-funded supercomputer when it begins production use in 2012. The aim of this report is to inform the BW project about MTC, including understanding aspects of MTC applications that can be used to characterize the domain and understanding the implications of these aspects to middleware and policies. Many MTC applications do not neatly fit the stereotypes of high-performance computing (HPC) or high-throughput computing (HTC) applications. Like HTC applications, by definition MTC applications are structured as graphs of discrete tasks, with explicit input and output dependencies forming the graph edges. However, MTC applications have significant features that distinguish them from typical HTC applications. In particular, different engineering constraints for hardware and software must be met in order to support these applications. HTC applications have traditionally run on platforms such as grids and clusters, through either workflow systems or parallel programming systems. MTC applications, in contrast, will often demand a short time to solution, may be communication intensive or data intensive, and may comprise very short tasks. Therefore, hardware and software for MTC must be engineered to support the additional communication and I/O and must minimize task dispatch overheads. The hardware of large-scale HPC systems, with its high degree of parallelism and support for intensive communication, is well suited for MTC applications. However, HPC systems often lack a dynamic resource-provisioning feature, are not ideal for task communication via the file system, and have an I/O system that is not optimized for MTC-style applications. Hence, additional software support is likely to be required to gain full benefit from the HPC hardware.

...read moreread less

21 citations

Proceedings Article•DOI•

Job and data clustering for aggregate use of multiple production cyberinfrastructures

[...]

Ketan Maheshwari¹, Allan Espinosa², Daniel S. Katz¹, Michael Wilde¹, Zhao Zhang², Ian Foster¹, S. Callaghan³, Philip Maechling³ - Show less +4 more•Institutions (3)

Argonne National Laboratory¹, University of Chicago², University of Southern California³

19 Jun 2012

TL;DR: The challenges of reducing the time-to-solution of the data intensive earthquake simulation workflow "CyberShake" is addressed by supplementing the high-performance parallel computing (HPC) resources on which it typically runs with distributed, heterogeneous resources that can be obtained opportunistically from grids and clouds.

...read moreread less

Abstract: In this paper, we address the challenges of reducing the time-to-solution of the data intensive earthquake simulation workflow "CyberShake" by supplementing the high-performance parallel computing (HPC) resources on which it typically runs with distributed, heterogeneous resources that can be obtained opportunistically from grids and clouds. We seek to minimize time to solution by maximizing the amount of work that can be efficiently done on the distributed resources. We identify data movement as the main bottleneck in effectively utilizing the combined local and distributed resources. We address this by analyzing the I/O characteristics of the application, processor acquisition rate (from a pilot-job service), and the data movement throughput of the infrastructure. With these factors in mind, we explore a combination of strategies including partitioning of computation (over HPC and distributed resources) and job clustering.We validate our approach with a theoretical study and with preliminary measurements on the Ranger HPC system and distributed Open Science Grid resources. More complete performance results will be presented in the final submission of this paper.

...read moreread less

6 citations

Proceedings Article•DOI•

Abstract: Bringing Task and Data Parallelism to Analysis of Climate Model Output

[...]

Robert Jacob¹, Jayesh Krishna¹, Xiabing Xu¹, Sheri Mickelson¹, Timothy J. Tautges¹, Michael Wilde¹, Robert Latham¹, Ian Foster¹, Robert Ross¹, Mark Hereld¹, Jay Larson¹, Pavel B. Bochev², Kara J. Peterson², Mark A. Taylor², Karen Schuchardt³, Jain Yin³, Don Middleton, Mary Haley, David E. Brown, Wei Huang, Dennis Shea, Richard Brownrigg, Mariana Vertenstein, Kwan-Liu Ma⁴, Jingrong Xie⁴ - Show less +21 more•Institutions (4)

Argonne National Laboratory¹, Sandia National Laboratories², Pacific Northwest National Laboratory³, University of California, Davis⁴

10 Nov 2012

TL;DR: A new data-parallel library is created, the Parallel Gridded Analysis Library (ParGAL), which can read in data using parallel I/O, store the data on a compete representation of the structured or unstructured mesh and perform sophisticated analysis on the data in parallel.

...read moreread less

Abstract: Climate models are both outputting larger and larger amounts of data and are doing it on more sophisticated numerical grids. The tools climate scientists have used to analyze climate output, an essential component of climate modeling, are single threaded and assume rectangular structured grids in their analysis algorithms. We are bringing both task- and data-parallelism to the analysis of climate model output. We have created a new data-parallel library, the Parallel Gridded Analysis Library (ParGAL) which can read in data using parallel I/O, store the data on a compete representation of the structured or unstructured mesh and perform sophisticated analysis on the data in parallel. ParGAL has been used to create a parallel version of a script-based analysis and visualization package. Finally, we have also taken current workflows and employed task-based parallelism to decrease the total execution time.

...read moreread less

1 citations