Order-Sensitive Imputation for Clustered Missing Values (Extended Abstract)

Qian Ma,Yu Gu,Wang-Chien Lee,Ge Yu
DOI: https://doi.org/10.1109/ICDE.2019.00268
2019-01-01
Abstract:To study the issue of missing values (MVs), we propose the Order-Sensitive Imputation for Clustered Missing values (OSICM) framework, in which missing values are imputed sequentially such that the values filled earlier in the process are also used for later imputation of other MVs. Obviously, the order of imputations is critical to the effectiveness and efficiency of OSICM framework. We formulate the searching of the optimal imputation order as an optimization problem, and show its NP-hardness. Furthermore, we devise an algorithm to find the exact optimal solution and propose two approximate/heuristic algorithms to trade off effectiveness for efficiency. Finally, we conduct extensive experiments on real and synthetic datasets to demonstrate the superiority of our OSICM framework.
What problem does this paper attempt to address?