An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type

Jiali Wang,Anton Westveld,Bronwyn Loong,Alan Welsh
DOI: https://doi.org/10.48550/arXiv.1910.05473
2019-10-12
Abstract:Copula-based methods provide a flexible approach to build missing data imputation models of multivariate data of mixed types. However, the choice of copula function is an open question. We consider a Bayesian nonparametric approach by using an infinite mixture of elliptical copulas induced by a Dirichlet process mixture to build a flexible copula function. A slice sampling algorithm is used to sample from the infinite dimensional parameter space. We extend the work on prior parallel tempering used in finite mixture models to the Dirichlet process mixture model to overcome the mixing issue in multimodal distributions. Using simulations, we demonstrate that the infinite mixture copula model provides a better overall fit compared to their single component counterparts, and performs better at capturing tail dependence features of the data. Simulations further show that our proposed model achieves more accurate imputation especially for continuous variables and better inferential results in some analytic models. The proposed model is applied to a medical data set of acute stroke patients in Australia.
Methodology
What problem does this paper attempt to address?