Prediction of mortality in hemodialysis patients based on autoencoders
Shuzhi Su,Jisheng Gao,Jingjing Dong,Qi Guo,Hualin Ma,Shaodong Luan,Xuejia Zheng,Huihui Tao,Lingling Zhou,Yong Dai
DOI: https://doi.org/10.1016/j.ijmedinf.2024.105744
IF: 4.73
2024-12-04
International Journal of Medical Informatics
Abstract:Background Patients with end-stage renal disease (ESRD) undergoing hemodialysis (HD) exhibit a high mortality risk, particularly at the onset of treatment. Conventional risk assessment models, dependent on extensive temporal data accumulation, frequently encounter issues of data incompleteness and lengthy collection periods. Objective This study addresses the imbalance in short-term HD data and the issue of missing data features, achieving a robust assessment of mortality risk for HD patients over the subsequent 30 to 450 days. Methods An autoencoder-based mortality prediction model for HD patients is proposed. Leveraging the manifold structure of the non-missing features and the intrinsic relationship between the features in the high-dimensional data space, the model infers the values of the missing features. Noise and redundant information typically distort the manifold structure, impacting the accuracy of inferences about missing features. Consequently, we generate feature dropping masks to simulate the missing data distribution in the deep learning framework and design an autoencoder, forming an adaptive feature extraction module. The module utilizes readily available short-term data for unsupervised learning, enabling the encoder to reconstruct missing features and derive latent representations. Finally, a classifier based on the latent representations achieves the mortality prediction. Results Over a 30-day observation window, the model demonstrated superior mortality prediction performance compared to other models across all prediction windows. Feature importance analysis showed that creatinine and age are consistently the most critical features across all prediction windows. Glucose (fasting) and platelet count also remain significant, with their correlation with mortality risk increasing over time. Serum albumin, international standard ratio, and phosphate are particularly important for short-term predictions, while conjugated bilirubin and prothrombin time gain prominence in mid- and long-term predictions. Conclusion The proposed model proficiently leverages short-term HD data to provide precise mortality risk evaluations in HD patients, with particular efficacy in the short-term. Its application holds considerable value for clinical decision-making and risk management in this patient population.
health care sciences & services,computer science, information systems,medical informatics