Semantic metadata generation for large scientific workflows

Jihie Kim, Yolanda Gil, Varun Ratnakar

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

15 Scopus citations

Abstract

In recent years, workflows have been increasingly used in scientific applications. This paper presents novel metadata reasoning capabilities that we have developed to support the creation of large workflows. They include 1) use of semantic web technologies in handling metadata constraints on file collections and nested file collections, 2) propagation and validation of metadata constraints from inputs to outputs in a workflow component, and through the links among components in a workflow, and 3) sub-workflows that generate metadata needed for workflow creation. We show how we used these capabilities to support the creation of large executable workflows in an earthquake science application with more than 7,000 jobs, generating metadata for more than 100,000 new files.

Original languageEnglish
Title of host publicationThe Semantic Web - ISWC 2006 - 5th International Semantic Web Conference, ISWC 2006, Proceedings
PublisherSpringer Verlag
Pages357-370
Number of pages14
ISBN (Print)3540490299, 9783540490296
DOIs
StatePublished - 2006
Event5th International Semantic Web Conference, ISWC 2006 - Athens, GA, United States
Duration: 5 Nov 20069 Nov 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4273 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference5th International Semantic Web Conference, ISWC 2006
Country/TerritoryUnited States
CityAthens, GA
Period5/11/069/11/06

Keywords

  • Grid workflows
  • Metadata reasoning
  • Workflow generation

Fingerprint

Dive into the research topics of 'Semantic metadata generation for large scientific workflows'. Together they form a unique fingerprint.

Cite this