usenix conference policies
Automatic Versus Manual Provenance Abstractions: Mind the Gap
Pinar Alper, University of Manchester; Khalid Belhajjame, Université Paris Dauphine; Carole A. Goble, University of Manchester
In recent years the need to simplify or to hide sensitive information in provenance has given way to research on provenance abstraction. In the context of scientific workflows, existing research provides techniques to semi-automatically create abstractions of a given workflow description, which is in turn used as filters over the workflow’s provenance traces. An alternative approach that is commonly adopted by scientists is to build workflows with abstractions embedded into the workflow’s design, such as using subworkflows. This paper reports on the comparison of manual versus semi-automated approaches in a context where result abstractions are used to filter report-worthy results of computational scientific analyses. Specifically; we take a real-world workflow containing user-created design abstractions and compare these with abstractions created by ZOOM*UserViews andWorkflow Summaries systems. Our comparison shows that semi-automatic and manual approaches largely overlap from a process perspective, meanwhile, there is a dramatic mismatch in terms of data artefacts retained in an abstracted account of derivation.We discuss reasons and suggest future research directions.
Open Access Media
USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.
title = {Automatic Versus Manual Provenance Abstractions: Mind the Gap},
booktitle = {8th USENIX Workshop on the Theory and Practice of Provenance (TaPP 16)},
year = {2016},
address = {Washington, D.C.},
url = {https://www.usenix.org/conference/tapp16/workshop-program/presentation/alper},
publisher = {USENIX Association},
month = jun
}
connect with us