Incomplete Data Classification with View-Based Decision Tree

Zhixin Qi,Hongzhi Wang,Dong Zhang
DOI: https://doi.org/10.1007/978-981-99-7657-7_4
2023-01-01
Abstract:Missing values bring negative influence in data analyses and decrease the accuracy of machine learning models. Since traditional classification methods are only able to be adopted on complete data sets, this chapter presents a generalized classification model for incomplete data in which existing classification models are easily embedded. We first generate complete views for the incomplete data based on the selection of proper attribute subsets. With the selected views, we obtain multiple base classifiers and a final classifier combined by base classifiers. Experimental results on real data sets show the effectiveness of the proposed classifier. We give the research motivation in Sect. 4.1. The sketch of tree-like structure is presented in Sect. 4.2. In Sect. 4.3, we discuss how to generate a view for each node. We report the experimental results and analysis in Sect. 4.4. Finally, in Sect. 4.5, we summarize the work of this chapter.
What problem does this paper attempt to address?