scispace - formally typeset
Search or ask a question

Showing papers on "Scientific workflow system published in 2005"


Journal ArticleDOI
01 Sep 2005
TL;DR: The main aspect of the taxonomy categorizes provenance systems based on why they record provenance, what they describe, how they represent and storeprovenance, and ways to disseminate it.
Abstract: Data management is growing in complexity as large-scale applications take advantage of the loosely coupled resources brought together by grid middleware and by abundant storage capacity. Metadata describing the data products used in and generated by these applications is essential to disambiguate the data and enable reuse. Data provenance, one kind of metadata, pertains to the derivation history of a data product starting from its original sources.In this paper we create a taxonomy of data provenance characteristics and apply it to current research efforts in e-science, focusing primarily on scientific workflow approaches. The main aspect of our taxonomy categorizes provenance systems based on why they record provenance, what they describe, how they represent and store provenance, and ways to disseminate it. The survey culminates with an identification of open research problems in the field.

1,214 citations


Journal ArticleDOI
01 Sep 2005
TL;DR: A taxonomy that characterizes and classifies various approaches for building and executing workflows on Grids is proposed that highlights the design and engineering similarities and differences of state-of-the-art in Grid workflow systems, but also identifies the areas that need further research.
Abstract: With the advent of Grid and application technologies, scientists and engineers are building more and more complex applications to manage and process large data sets, and execute scientific experiments on distributed resources. Such application scenarios require means for composing and executing complex workflows. Therefore, many efforts have been made towards the development of workflow management systems for Grid computing. In this paper, we propose a taxonomy that characterizes and classifies various approaches for building and executing workflows on Grids. The taxonomy not only highlights the design and engineering similarities and differences of state-of-the-art in Grid workflow systems, but also identifies the areas that need further research.

583 citations


01 Jan 2005
TL;DR: GridNexus allows the user to assemble complex processes involving data retrieval, analysis and visualization by building a directed acyclic graph in a visual environment and increases system flexibility while retaining interactive capabilities.
Abstract: We introduce GridNexus, a graphical system for creating and executing scientific workflows in a grid environment 1. . GridNexus allows the user to assemble complex processes involving data retrieval, analysis and visualization by building a directed acyclic graph in a visual environment. Workflows in GridNexus are described by a script written in a language called JXPL. The script can be executed either locally or in a remote managed-job environment. GridNexus enables the composition of larger software modules from smaller ones to build very complex tasks and separates the graphical user interface (GUI) from execution of the workflow. This increases system flexibility while retaining interactive capabilities. We compare our approach with that of similar projects, and provide an assessment of the system by highlighting two real-world scientific applications that have taken advantage of the GridNexus environment.

38 citations