Early Warning Models Using Machine Learning to Predict Sepsis-Associated Chronic Critical Illness: A Study Based on the Medical Information Mart for Intensive Care Database

Yulin Mei,Meng Li,Yuqi Li,Ximei Sheng,Chunyan Zhu,Xiaoqin Fan,Lei Zhang,Aijun Pan
DOI: https://doi.org/10.7759/cureus.67121
2024-08-18
Cureus
Abstract:Background Patients with chronic critical illness (CCI) experience poor prognoses and incur high medical costs. However, there is currently limited clinical awareness of sepsis-associated CCI, resulting in insufficient vigilance. Therefore, it is necessary to build a machine learning model that can predict whether sepsis patients will develop CCI. Methods Clinical data on 19,077 sepsis patients from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database were analyzed. Predictive factors were identified using the Student's t-test, Mann-Whitney U test, or χ 2 test. Six machine learning classification models, namely, the logistic regression, support vector machine, decision tree, random forest, extreme gradient enhancement, and artificial neural network, were established. The optimal model was selected on the basis of its performance. Calibration curves were used to evaluate the accuracy of model classification, while the external validation dataset was used to evaluate the performance of the model. Results Thirty-seven characteristics, such as elevated alanine aminotransferase, rapid heart rate, and high Logistic Organ Dysfunction System scores, were identified as risk factors for developing CCI. The area under the receiver operating characteristic curve (AUROC) values for all models were above 0.73 on the internal test set. Among them, the extreme gradient enhancement model exhibited superior performance (F1 score = 0.91, AUROC = 0.91, Brier score = 0.052). It also exhibited stable prediction performance on the external validation set (AUROC = 0.72). Conclusion A machine learning model was established to predict whether sepsis patients will develop CCI. It can provide useful predictive information for clinical decision-making.
What problem does this paper attempt to address?