A Rule- Based Interactive Data Cleaning Technique

MENG Jian,DONG Yi-sheng,WANG Yong-li
DOI: https://doi.org/10.3969/j.issn.1673-629x.2005.04.048
2005-01-01
Abstract:There are three shortcomings in existing data cleaning tools.One is lack of human interaction,so users can't control the data cleaning processes and can't solve the exceptions in the processes;Another is lack of logical declaration about data transformation rules and data cleaning rules,so the rules are not independent of physical realization;The last is lack of management of metadata,so the users cann't analyse or adjust the data cleaning processes.The paper proposes a new rule-based interactive data cleaning framework to solve these shortcomings.So the data cleaning becomes more efficient, and data quality can be guaranteed.By describing the definition and execution of cleaning rules,this article also expatiates the architecture of the data cleaning framework.
What problem does this paper attempt to address?