Provenance trails in the Wings/Pegasus system

Jihie Kim, Ewa Deelman, Yolanda Gil, Gaurang Mehta, Varun Ratnakar

Research output: Contribution to journalArticlepeer-review

112 Scopus citations

Abstract

Our research focuses on creating and executing large-scale scientific workflows that often involve thousands of computations over distributed, shared resources. We describe an approach to workflow creation and refinement that uses semantic representations to (1) describe complex scientific applications in a data-independent manner, (2) automatically generate workflows of computations for given data sets, and (3) map the workflows to available computing resources for efficient execution. Our approach is implemented in the Wings/Pegasus workflow system and has been demonstrated in a variety of scientific application domains. This paper illustrates the application-level provenance information generated Wings during workflow creation and the refinement provenance by the Pegasus mapping system for execution over grid computing environments. We show how this information is used in answering the queries of the First Provenance Challenge.

Original languageEnglish
Pages (from-to)587-597
Number of pages11
JournalConcurrency Computation Practice and Experience
Volume20
Issue number5
DOIs
StatePublished - 10 Apr 2008

Keywords

  • Large scientific workflows
  • Refinement provenance
  • Semantic metadata
  • Workflow mapping
  • Workflow provenance
  • Workflow validation

Fingerprint

Dive into the research topics of 'Provenance trails in the Wings/Pegasus system'. Together they form a unique fingerprint.

Cite this