A New Effective Information Decomposition Approach for Missing Data Recovery

shigang liu,honghua dai
2014-01-01
Abstract:It is well recognized that missing data could cause severe problem in data mining. Due to its importance lots of work has been done in the past. Several algorithms [5-8] are proposed for missing data recovery. This paper presents a new 1-dimensional linear information decomposition (1DLID) approach which is easier for use in missing data recovery. In this article, we study one particular problem, in which 1-dimensional data set is given and certain percentage of data are missing without any other additional information. Then the proposed 1-DLID method is used for creating the complete data set from both the generated data set and realworld data set. Comparatively, our experiments showed that the proposed method is reliable and can be used for the recovery of data set with missing values. The advantages of the proposed method are: 1) Will not change the distribution of the data set. 2) Easy to use for 1-dimensional dataset. 3) Have a higher accuracy, especially there is 10%~30% data missing. 4) No need to provide the historical data set.
What problem does this paper attempt to address?