An analysis of missing data treatment methods and their application to health care dataset

Peng Liu,Elia El-Darzi,Lei Lei,Christos Vasilakis,Panagiotis Chountas,Wei Huang
DOI: https://doi.org/10.1007/11527503_69
2005-01-01
Abstract:It is well accepted that many real-life datasets are full of missing data. In this paper we introduce, analyze and compare several well known treatment methods for missing data handling and propose new methods based on Naive Bayesian classifier to estimate and replace missing data. We conduct extensive experiments on datasets from UCI to compare these methods. Finally we apply these models to a geriatric hospital dataset in order to assess their effectiveness on a real-life dataset.
What problem does this paper attempt to address?