Provenance in Open Data Entity-Centric Aggregation

Fausto Giunghiglia,Moaz Reyad
DOI: https://doi.org/10.1007/978-3-319-16462-5_22
2015-01-01
Abstract:Recently an increasing number of open data catalogs appear on the Web [1]. These catalogs contain data that represents real world entities and their attributes. Data can be imported from several catalogs to build web services; hence there is a need to trace the source of each entity and attribute value in a way that handles also the possible conflicts between attribute values coming from overlapping sources [2]. For open data, source tracing requires capturing both the provenance [3] of the attribute values and the identity links [4] between entities. Moreover, resolving the conflicts manually becomes harder with the increasing size of data.
What problem does this paper attempt to address?