Capturing Interactive Data Transformation Operations Using Provenance Workflows (bibtex)
by Tope Omitola, André Freitas, Edward Curry, Séan O'Riain, Nicholas Gibbins, Nigel Shadbolt
Abstract:
The ready availability of data is leading to the increased opportunity of their re-use for new applications and for analyses. Most of these data are not necessarily in the format users want, are usually heterogeneous, and highly dynamic, and this necessitates data transformation e orts to re-purpose them. Interactive data transformation (IDT) tools are becoming easily available to lower these barriers to data transformation e orts. This paper describes a principled way to capture data lineage of interactive data transformation processes. We provide a formal model of IDT, its mapping to a provenance representation, and its implementation and validation on Google Re ne. Provision of the data transformation process sequences allows assessment of data quality and ensures portability between IDT and other data transformation platforms. The proposed model showed a high level of coverage against a set of requirements used for evaluating systems that provide provenance management solutions.
Reference:
Tope Omitola, André Freitas, Edward Curry, Séan O'Riain, Nicholas Gibbins, Nigel Shadbolt, "Capturing Interactive Data Transformation Operations Using Provenance Workflows", Chapter in 3rd International Workshop on Role of Semantic Web in Provenance Management (SWPM 2012), pp. 29-42, 2012. [slides]
Bibtex Entry:
@incollection{Omitola2012,
abstract = {The ready availability of data is leading to the increased opportunity of their re-use for new applications and for analyses. Most of these data are not necessarily in the format users want, are usually heterogeneous, and highly dynamic, and this necessitates data transformation e orts to re-purpose them. Interactive data transformation (IDT) tools are becoming easily available to lower these barriers to data transformation e orts. This paper describes a principled way to capture data lineage of interactive data transformation processes. We provide a formal model of IDT, its mapping to a provenance representation, and its implementation and validation on Google Re ne. Provision of the data transformation process sequences allows assessment of data quality and ensures portability between IDT and other data transformation platforms. The proposed model showed a high level of coverage against a set of requirements used for evaluating systems that provide provenance management solutions.},
annote = {<a href="http://www.slideshare.net/andrenfreitas/eswc-idts-final">[slides]</a>},
author = {Omitola, Tope and Freitas, Andr{\'{e}} and Curry, Edward and O'Riain, S{\'{e}}an and Gibbins, Nicholas and Shadbolt, Nigel},
booktitle = {3rd International Workshop on Role of Semantic Web in Provenance Management (SWPM 2012)},
doi = {10.1007/978-3-662-46641-4_3},
file = {:Users/ed/Library/Application Support/Mendeley Desktop/Downloaded/Omitola et al. - 2012 - Capturing Interactive Data Transformation Operations Using Provenance Workflows.pdf:pdf},
keywords = {Consumption,Data,Data Publication,Extract-Transform-Load,Interactive Data Transformation.,LEIdataspace,Linked Data,Provenance,Public Open Data,Semantic Web,Workflow},
mendeley-tags = {LEIdataspace},
pages = {29--42},
title = {{Capturing Interactive Data Transformation Operations Using Provenance Workflows}},
url = {http://www.edwardcurry.org/publications/Omitola_Eswc2012_Provenance_Wkshop.pdf},
year = {2012}
}
Powered by bibtexbrowser