A stacking-based model for predicting 30-day all-cause hospital readmissions of patients with acute myocardial infarction

Zhen Zhang,Hang Qiu,Weihao Li,Yucheng Chen
DOI: https://doi.org/10.1186/s12911-020-01358-w
IF: 3.298
2020-12-01
BMC Medical Informatics and Decision Making
Abstract:Abstract Background Acute myocardial infarction (AMI) is a serious cardiovascular disease, followed by a high readmission rate within 30-days of discharge. Accurate prediction of AMI readmission is a crucial way to identify the high-risk group and optimize the distribution of medical resources. Methods In this study, we propose a stacking-based model to predict the risk of 30-day unplanned all-cause hospital readmissions for AMI patients based on clinical data. Firstly, we conducted an under-sampling method of neighborhood cleaning rule (NCR) to alleviate the class imbalance and then utilized a feature selection method of SelectFromModel (SFM) to select effective features. Secondly, we adopted a self-adaptive approach to select base classifiers from eight candidate models according to their performances in datasets. Finally, we constructed a three-layer stacking model in which layer 1 and layer 2 were base-layer and level 3 was meta-layer. The predictions of the base-layer were used to train the meta-layer in order to make the final forecast. Results The results show that the proposed model exhibits the highest AUC (0.720), which is higher than that of decision tree (0.681), support vector machine (0.707), random forest (0.701), extra trees (0.709), adaBoost (0.702), bootstrap aggregating (0.704), gradient boosting decision tree (0.710) and extreme gradient enhancement (0.713). Conclusion It is evident that our model could effectively predict the risk of 30-day all cause hospital readmissions for AMI patients and provide decision support for the administration.
medical informatics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the risk prediction of unplanned readmission within 30 days after discharge for patients with acute myocardial infarction (AMI). Specifically, the author proposes a model based on the stacking technique, aiming to use clinical data to predict the risk of unplanned readmission within 30 days for AMI patients. This prediction is of great significance for identifying high - risk groups, optimizing the allocation of medical resources, and providing decision - support. ### Background Acute myocardial infarction (AMI) is a serious cardiovascular disease, and the readmission rate of patients within 30 days after discharge is relatively high. Accurately predicting the readmission risk of AMI patients is helpful for identifying high - risk groups and optimizing the allocation of medical resources. ### Methods 1. **Data pre - processing**: - Collected and pre - processed clinical data from West China Hospital of Sichuan University. - Used the Neighborhood Cleaning Rule (NCR) for under - sampling to alleviate the class imbalance problem. - Utilized the feature selection method (SelectFromModel, SFM) to select effective features. 2. **Model construction**: - Adaptively selected base classifiers from eight candidate models. - Constructed a three - layer stacking model, where the first and second layers are the base layers and the third layer is the meta - layer. - The base classifiers in the first and second layers generate prediction results through five - fold cross - validation, and these prediction results are used to train the meta - classifier (logistic regression model) in the third layer. ### Results - The proposed stacking model performs best in terms of the AUC metric, reaching 0.720, which is higher than other models such as decision tree (0.681), support vector machine (0.707), random forest (0.701), extra tree (0.709), AdaBoost (0.702), Bootstrap Aggregating (0.704), gradient - boosted decision tree (0.710), and extreme gradient boosting (0.713). ### Conclusions - The research shows that the proposed stacking model can effectively predict the risk of unplanned readmission within 30 days for AMI patients and provide decision - support for medical management. ### Keywords - Acute myocardial infarction - Hospital readmission - Clinical data - Machine learning - Adaptive - Stacking - based model learning ### Formulas - **Normalization formula**: \[ x^*=\frac{x - \text{mean}}{\sigma} \] where \(x\) is the input feature, \(\text{mean}\) and \(\sigma\) represent the average and standard deviation of the input feature respectively, and \(x^*\) represents the normalized output value. - **Accuracy**: \[ \text{Accuracy}=\frac{TP + TN}{TP + TN + FP + FN} \] - **Sensitivity**: \[ \text{Sensitivity}=\frac{TP}{TP + FN} \] - **Specificity**: \[ \text{Specificity}=\frac{TN}{TN + FP} \] Through these methods and metrics, the paper successfully solves the problem of risk prediction of unplanned readmission within 30 days for AMI patients and provides an effective solution.