Provenance Analysis: Towards Quality Provenance

Loading...
Thumbnail Image
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.

Date

2012

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Data provenance, a key piece of metadata that describes the lifecycle of a data product, is crucial in aiding scientists to better understand and facilitate reproducibility and reuse of scientific results. Provenance collection systems often capture provenance on the fly and the protocol between application and provenance tool may not be reliable. As a result, data provenance can become ambiguous or simply inaccurate. In this paper, we identify likely quality issues in data provenance. We also establish crucial quality dimensions that are especially critical for the evaluation of provenance quality. We analyze synthetic and real-world provenance based on these quality dimensions and summarize our contributions to provenance quality.

Description

Keywords

Data Provenance, Provenance Quality, Scientific Workflows, Provenance Analysis

Citation

Journal

DOI

Link(s) to data and video for this item

Relation

Rights

Type