CLUE: Personalized Hospital Readmission Prediction Against Data Insufficiency under Imbalanced-Data Environment

Qianwen Meng,Lizhen Cui,Guoxian Yu,Han Yu,Wei Guo,Hui Li
DOI: https://doi.org/10.1109/BIBM49941.2020.9313562
2020-01-01
Abstract:Hospital readmission prediction employs reliable predictive models to evaluate the readmission risk of patients upon discharge. Identifying patients with high readmission risk and paying additional attention to them can ease the burden on both patients and society. Recently, considerable attention has been paid to personalized readmission predictions, i.e., building an independent model for each target patient. However, existing personalized predictive models can be easily affected by data insufficiency and provide poor generalization capabilities. To address these challenges, in this paper, we propose the CLuster-based mUlti-task lEarning model (CLUE) to achieve personalized hospital readmission prediction. CLUE groups patients into different clusters based on a multi-angle similarity metric to preserve the interrelated information of patients with highly similar clinical behaviors. Due to different group characteristics of patients, the clusters of patients are imbalanced. Given that, CLUE treats the hospital readmission prediction for each cluster of patients as one task and learns multiple tasks jointly by parameter sharing mechanisms. In this way, not only can the data insufficiency problem be alleviated by supplementing individual models with the shared information from other clusters, but also the specific information of each cluster can be preserved for personalization. We conduct extensive experiments on a real-world dataset of electronic health records, and show that CLUE significantly outperforms competitive comparative methods.
What problem does this paper attempt to address?