A Retrospective Cohort Study: Predicting 90-Day Mortality for ICU Trauma Patients with a Machine Learning Algorithm Using XGBoost Using MIMIC-III Database
Shan Yang,Lirui Cao,Yongfang Zhou,Chenggong Hu
DOI: https://doi.org/10.2147/JMDH.S416943
2023-09-07
Journal of Multidisciplinary Healthcare
Abstract:Shan Yang, 1 Lirui Cao, 2 Yongfang Zhou, 3 Chenggong Hu 1 1 Department of Critical Care Medicine, West China Hospital of Sichuan University, Chengdu, Sichuan, 610041, People's Republic of China; 2 West China Hospital of Sichuan University, Chengdu, Sichuan, 610041, People's Republic of China; 3 Department of Respiratory Care, West China Hospital of Sichuan University, Chengdu, Sichuan, 610041, People's Republic of China Correspondence: Chenggong Hu; Yongfang Zhou, Email ; Objective: The aim of this study was to develop and validate a machine learning-based predictive model that predicts 90-day mortality in ICU trauma patients. Methods: Data of patients with severe trauma were extracted from the Medical Information Mart for Intensive Care III (MIMIC-III) database. The performances of mortality prediction models generated using nine machine learning extreme gradient boosting (XGBoost), logistic regression, random forest, AdaBoost, multilayer perceptron (MLP) neural networks, support vector machine (SVM), light gradient boosting machine (GBM), k nearest neighbors (KNN) and gaussian naive bayes (GNB). The performance of the model was evaluated in terms of discrimination, calibration and clinical application. Results: We found that the accuracy, sensitivity, specificity, PPV, NPV and F1 score of our proposed XGBoost model were 82.8%, 79.7%, 77.6%, 51.2%, 91.5% and 0.624, respectively. Among the nine models, the XGBoost model performed best. Compared with traditional logistic regression, the calibration curves of the XGBoost model and decision curve analysis (DCA) performed well. Conclusion: Our study shows that the XGBoost model outperforms other machine learning models in predicting 90-day mortality in trauma patients. It can be used to assist clinicians in the early identification of mortality risk factors and early intervention to reduce mortality. Keywords: MIMIC-III, severe trauma patient, intensive care unit, XGBoost, mortality, prediction model Trauma is a major cause of death in the United States, a worldwide public health issue with serious economic burdens, and an important cause of life expectancy loss. 1,2 It has been reported that trauma is the main cause of death in the first forty years of life. Trauma causes 4.4 million deaths annually, accounting for almost 8% of global deaths. 3,4 Patients with severe trauma usually require admission to the ICU, and trauma is a common disease in the ICU, with variable morbidity and mortality rates. 5 For the assessment of trauma prognosis, several methods for assessing the severity of injury have been developed over the past decades. Common scoring systems include the Injury Severity Scale (ISS), Revised Trauma Score (RTS), and Trauma and Injury Severity Score (TRISS). 6 Cook et al compared the Trauma Audit and Research Network (TARN) with the Trauma Mortality Prediction Mode (TMPM). TMPM should be considered a measure of injury severity. 7 The above scores and models provide clinical importance, but these methods and scores require the assumption of independent and linear relationships between explanatory. The above scores and models and various modifications are evidence-based tools, and some research findings suggest that they may mislead doctors by misclassifying patients' conditions. However, when there are collinearity, heteroscedasticity, high-order interactions, and nonlinear relationships between variables, the performance of these two types of models is poor. 8,9 Therefore, more valuable and accurate prognostic tools that are not limited to these assumptions are needed to achieve better patient outcomes and maximize resource utilization. The new machine learning technology performs better prediction than traditional prediction methods. Modern ICUs are rich in data through continuous patient monitoring. Advances in computer technology and the establishment of specialized databases such as MIMIC have helped more doctors recognize and focus on machine learning, and machine learning methods are gaining acceptance, providing opportunities for data science and machine learning. 10,11 In addition, machine learning can determine the combination of reliable prediction results by observing patients and automatically calculating important variables and empirical patterns based on a large number of variables. 12 Machine learning (ML) "learns" models from past data to predict future data. 13 Learning is one of the key processes in artificial intelligence. ML for predicting and extracting information from data is increasingly being applied in many different fields, from medicine to finance. 14 Due to these algor -Abstract Truncated-
health care sciences & services