Construction and Validation of Prognostic Models in Critically Ill Patients with Sepsis-associated Acute Kidney Injury: machine learning approaches compare with traditional logistic regression.

Zhiyan Fan,Jiamei Jiang,Fanghui Chen,Xiao Chen,Youlei Chen,Xia Qin,Juan Wang,Mengjuan Fang,Zhijing Wu
DOI: https://doi.org/10.21203/rs.3.rs-2429979/v1
2023-01-01
Abstract:Abstract Background Acute kidney injury (AKI) is a common complication in critically ill patients with sepsis and often represents a poor prognosis. However, the application of machine learning methods in this filed is lacking. We aim to construct and validate clinical prognosis prediction models for patients with sepsis associated acute kidney injury (S-AKI) with machine learning methods. Method Data of training cohort patients were collected from the Medical Information Mart for Intensive Care III database version 1.4 ( MIMIC III v1.4) to build models, and data of patients were extracted from Hangzhou First People's Hospital Affiliated to Zhejiang University School of Medicine for model external validation. Predictors for mortality were initially identify by the least absolute shrinkage and selection operator (LASSO) regression, and then random forest (RF), Gradient Boosted Decision Trees (GBDT), Neural network models: Multi-layer Perceptron(MLP), Support vector machines(SVMs) and traditional Logistic regression(LR) were used to establish prediction models for 7 days, 14 days, and 28 days after ICU admission, respectively. The prediction performance was assessed using receiver operating characteristic (ROC) curves, decision curve analysis (DCA) and f1-score. Result A total of 1982 critically ill patients with S-AKI were included for analysis, of which 1882 patients for model development, 100 patients for external validation. The overall 7-day mortality was about 23.6%. A total 20 variables were selected for model establishment. The models of LR, RF, GBDT, MLP, SVM were established and obtained areas under the ROC curves (AUC) of 0.74, 0.86, 0.88, 0.83, 0.75 in 7 days group, 0.62, 0.70, 0.72, 0.67, 0.61 in 14 days group, 0.6, 0.61, 0.57, 0.56, 0.6 in group 28 days in training cohort. According to the results of AUC, f1-score, and DCA in the training cohort for the 7-day, 14-day, 28-day for the five models, the model of RF and GBDT exhibits excellent performance. The RF and GBDT models also have Excellent discrimination in validation cohort. Conclusion By utilizing the machine learning approaches we construct more significant prediction models. Clinically, the RF and GBDT models might be useful in helping clinicians craft precise treatment and management plans for patients with S-AKI.
What problem does this paper attempt to address?