Research on data cleaning based on domain-ontology

WANG Hao,XU Hong-bing
DOI: https://doi.org/10.3969/j.issn.1000-7024.2006.22.032
2006-01-01
Abstract:The semantic issues in data cleaning are investigated.Two concepts,domain concept tree and accuracy level node set,are proposed based on domain ontologies.A new method to clean data is presented on the basis of the two concepts.This method improve the cleaning quality due to the use of semantics included in domain ontologies.Compared to the traditional methods,it not only scale better,but also attain much higher cleaining quanlity,because it only interact with domain ontologies.
What problem does this paper attempt to address?