scispace - formally typeset
Open AccessProceedings Article

Best practices for workflow design: how to prevent workflow decay

TLDR
It is argued that good workflow design is a prerequisite for repairing a workflow, or redesigning an equivalent workflow pattern with new components, and the semantic tooling that is being developed in the Workflow4Ever project to support these best practices are presented.
Abstract
In this position paper we present a set of best practices for workflow design to prevent workflow decay and increase reuse and re-purposing of scientific workflows. MyExperiment provides access to a large number of scientific workflows. However, scientists find it difficult to reuse or re-purpose these workflows for mainly two reasons: workflows suffer from decay over time and lack sufficient metadata to understand their purpose. We argue that good workflow design is a prerequisite for repairing a workflow, or redesigning an equivalent workflow pattern with new components. We present a set of best practices for workflow design and the semantic tooling that is being developed in the Workflow4Ever (Wf4Ever) project to support these best practices.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities

TL;DR: This study characterize and define the criteria that need to be catered for by reproducibility-friendly scientific workflow systems, and use such criteria to place several representative and widely used workflow systems and companion tools within such a framework.
Proceedings ArticleDOI

Four level provenance support to achieve portable reproducibility of scientific workflows

TL;DR: The necessary and satisfactory parameters of workflow reproducecibility are investigated and a mathematical formula to determine the rate of reproducibility is given and these measurements allow the scientist to make a decision about the next steps toward the creation of reproducible workflows.
Journal ArticleDOI

Repeat: a framework to assess empirical reproducibility in biomedical research

TL;DR: The use of RepeAT may allow the biomedical community to have a better understanding of the current practices of research transparency and accessibility among principal investigators and common adoption ofrepeAT may improve reporting of research practices and the availability of research outputs.
References
More filters
Journal ArticleDOI

Why linked data is not enough for scientists

TL;DR: This paper makes the case for a scientific data publication model on top of linked data and introduces the notion of Research Objects as first class citizens for sharing and publishing.
Proceedings ArticleDOI

Why workflows break — Understanding and combating decay in Taverna workflows

TL;DR: A minimal set of auxiliary resources to be preserved together with the workflows as an aggregation object and provide a software tool for end-users to create such aggregations and to assess their completeness.
Book ChapterDOI

Seven bottlenecks to workflow reuse and repurposing

TL;DR: Based on a comparison of e-Science middleware projects, this paper identifies seven bottlenecks to scalable reuse and repurposing, and includes some thoughts on the applicability of using OWL for two bott lenecks: workflow fragment discovery and the ranking of fragments.
Related Papers (5)