Discovering Abnormal Data in RDF Knowledge Base

HE Binbin,ZOU Lei,ZHAO Dongyan
DOI: https://doi.org/10.13209/j.0479-8023.2015.033
2015-01-01
Abstract:To effectively improve the data quality of RDF knowledge base, a solution is proposed about abnoraml data discovery and errouneous data repair in RDF graphs. Firstly, the authors innovatively define graph-based conditional functional dependency(GCFD) that can represent the attribute value and semantic structure dependencies of RDF data in a uniform manner. Then, an efficient framework and some novel pruning rules are proposed to discover GCFDs, and the workflow of auto-repairing errorneous data are given. Extensive experiments on several real-life RDF repositories confirm the superiority of proposed solution.
What problem does this paper attempt to address?