Leveraging Currency for Repairing Inconsistent and Incomplete Data (extended Abstract).

Xiaoou Ding,Hongzhi Wang,Jiaxuan Su,Muxian Wang,Jianzhong Li,Hong Gao
DOI: https://doi.org/10.1109/icde51399.2021.00243
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:With the growth of data from various sources, data quality is faced with multiple problems. In this paper, we study the multiple data cleaning on incompleteness and inconsistency with currency reasoning and determination. We introduce a 4-step method, named Imp3C, for error detection and repair in incomplete and inconsistent data without timestamps. We propose an integrated currency determining approach to compute currency order among tuples, thus, the dirty data can be repaired effectively considering the temporal impact. Experiments on three real-life datasets verify that Imp3C improves data repairing performance with multiple quality problems, especially in datasets with complex currency orders.
What problem does this paper attempt to address?