Missing Data Imputation by Utilizing Information Within Incomplete Instances

Shichao Zhang,Zhi Jin,Xiaofeng Zhu
DOI: https://doi.org/10.1016/j.jss.2010.11.887
IF: 3.5
2010-01-01
Journal of Systems and Software
Abstract:This paper proposes to utilize information within incomplete instances (instances with missing values) when estimating missing values. Accordingly, a simple and efficient nonparametric iterative imputation algorithm, called the NIIA method, is designed for iteratively imputing missing target values. The NIIA method imputes each missing value several times until the algorithm converges. In the first iteration, all the complete instances are used to estimate missing values. The information within incomplete instances is utilized since the second imputation iteration. We conduct some experiments for evaluating the efficiency, and demonstrate: (1) the utilization of information within incomplete instances is of benefit to easily capture the distribution of a dataset; and (2) the NIIA method outperforms the existing methods in accuracy, and this advantage is clearly highlighted when datasets have a high missing ratio.
What problem does this paper attempt to address?