Handling Incomplete Data in Survival Analysis with Multiple Covariates

Yi Yu,Lin Ma,Yong Sun,Yuantong Gu
DOI: https://doi.org/10.1007/978-0-85729-493-7_88
2012-01-01
Abstract:This paper studies the missing covariate problem which is often encountered in survival analysis. Three covariate imputation methods are employed in the study, and the effectiveness of each method is evaluated within the hazard prediction framework. Data from a typical engineering asset is used in the case study. Covariate values in some time steps are deliberately discarded to generate an incomplete covariate set. It is found that although the mean imputation method is simpler than others for solving missing covariate problems, the results calculated by it can differ largely from the real values of the missing covariates. This study also shows that in general, results obtained from the regression method are more accurate than those of the mean imputation method but at the cost of a higher computational expensive. Gaussian Mixture Model (GMM) method is found to be the most effective method within these three in terms of both computation efficiency and prediction accuracy.
What problem does this paper attempt to address?