You are here
D-PROV: Extending the PROV Provenance Model with Workﬂow Structure
Paolo Missier, Newcastle University; Saumen Dey, University of California, Davis; Khalid Belhajjame, University of Manchester; Victor Cuevas-Vicenttín and Bertram Ludäscher, University of California, Davis
This paper presents an extension to the W3C PROV provenance model, aimed at representing process structure. Although the modelling of process structure is out of the scope of the PROV speciﬁcation, it is beneﬁcial when capturing and analyzing the provenance of data that is produced by programs or other formally encoded processes. In the paper, we motivate the need for such and extended model in the context of an ongoing large data federation and preservation project, DataONE, where provenance traces of scientiﬁc workﬂow runs are captured and stored alongside the data products. We introduce new provenance relations for modelling process structure along with their usage patterns, and present sample queries that demonstrate their beneﬁt.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.