Towards a Model of Provenance and User Views in Scientific Workflows

Loading...
Thumbnail Image
Penn collection
Departmental Papers (CIS)
Degree type
Discipline
Subject
scientific workflows
provenance
database
Funder
Grant number
License
Copyright date
Distributor
Related resources
Author
Cohen, Shirley
Cohen-Boulakia, Sarah
Contributor
Abstract

Scientific experiments are becoming increasingly large and complex, with a commensurate increase in the amount and complexity of data generated. Data, both intermediate and final results, is derived by chaining and nesting together multiple database searches and analytical tools. In many cases, the means by which the data are produced is not known, making the data difficult to interpret and the experiment impossible to reproduce. Provenance in scientific workflows is thus of paramount importance. In this paper, we provide a formal model of provenance for scientific workflows which is general (i.e. can be used with existing workflow systems, such as Kepler, myGrid and Chimera) and sufficiently expressive to answer the provenance queries we encountered in a number of case studies. Interestingly, our model not only takes into account the chained and nested structure of scientific workflows, but allows asks for provenance at different levels of abstraction (user views).

Advisor
Date of presentation
2006-07-01
Conference name
Departmental Papers (CIS)
Conference dates
2023-05-17T00:20:39.000
Conference location
Date Range for Data Collection (Start Date)
Date Range for Data Collection (End Date)
Digital Object Identifier
Series name and number
Volume number
Issue number
Publisher
Publisher DOI
Journal Issue
Comments
Postprint version. Published in Lecture Notes in Computer Science, Volume 4076, July 2006, pages 264-279. Publisher URL: http://dx.doi.org/10.1007/11799511
Recommended citation
Collection