Temporal Representation for Scientific Data Provenance

dc.altmetrics.displaytrue
dc.contributor.authorChen, Peng
dc.contributor.authorPlale, Beth
dc.contributor.authorAktas, Mehmet S.
dc.date.accessioned2012-09-14T20:11:54Z
dc.date.available2012-09-14T20:11:54Z
dc.date.issued2012-09
dc.description.abstractProvenance of digital scientific data is an important piece of the metadata of a data object. It can however grow voluminous quickly because the granularity level of capture can be high. It can also be quite feature rich. We propose a representation of the provenance data based on logical time that reduces the feature space. Creating time and frequency domain representations of the provenance, we apply clustering, classification and association rule mining to the abstract representations to determine the usefulness of the temporal representation. We evaluate the temporal representation using an existing 10 GB database of provenance captured from a range of scientific workflows.
dc.description.sponsorshipNASA grant NNX10AM03G
dc.identifier.citationChen, Peng, Beth Plale, and Mehmet Aktas. “Temporal Representation for Scientific Data Provenance.” Preprint of paper accepted for the 8th IEEE International Conference on eScience (eScience 2012), submitted September 14, 2012. http://hdl.handle.net/2022/14665.
dc.identifier.urihttps://hdl.handle.net/2022/14665
dc.language.isoen_US
dc.publisher
dc.subjectprovenance representation
dc.subjectlogical clock
dc.subjecttemporal data mining
dc.titleTemporal Representation for Scientific Data Provenance
dc.typePreprint

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
eSciencePrePrint.pdf
Size:
602.42 KB
Format:
Adobe Portable Document Format
Can’t use the file because of accessibility barriers? Contact us