It’s About the Data: Provenance as a Tool for Assessing Data Fitness


Adriane Chapman, M. David Allen, and Barbara Blaustein, The MITRE Corporation


The end goal of provenance is to assist users in understanding their data: How was it created? When? By whom? How was it manipulated? In other words, provenance is a powerful tool to help users answer the question, “Is this data fit for use?” However, there is no one set of criteria that make data “fit for use”. The criteria depend on the user, the task at hand, and the current situation. In this work we describe Fitness Widgets, predefined queries over provenance graphs that users can customize to determine data fitness. We have implemented Fitness Widgets in our provenance system, PLUS.


Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

@inproceedings {179547,
title = {{It{\textquoteright}s} About the Data: Provenance as a Tool for Assessing Data Fitness},
booktitle = {4th USENIX Workshop on the Theory and Practice of Provenance (TaPP 12)},
year = {2012},
address = {Boston, MA},
url = {},
publisher = {USENIX Association},
month = jun,

Presentation Video

Presentation Audio