Abstract:Sepsis-Associated Liver Injury (SALI) is an independent risk factor for death from sepsis. The aim of this study was to develop an interpretable machine learning model for early prediction of 28-day mortality in patients with SALI. Data from the Medical Information Mart for Intensive Care (MIMIC-IV, v2.2, MIMIC-III, v1.4) were used in this study. The study cohort from MIMIC-IV was randomized to the training set (0.7) and the internal validation set (0.3), with MIMIC-III (2001 to 2008) as external validation. The features with more than 20% missing values were deleted and the remaining features were multiple interpolated. Lasso-CV that lasso linear model with iterative fitting along a regularization path in which the best model is selected by cross-validation was used to select important features for model development. Eight machine learning models including Random Forest (RF), Logistic Regression, Decision Tree, Extreme Gradient Boost (XGBoost), K Nearest Neighbor, Support Vector Machine, Generalized Linear Models in which the best model is selected by cross-validation (CV_glmnet), and Linear Discriminant Analysis (LDA) were developed. Shapley additive interpretation (SHAP) was used to improve the interpretability of the optimal model. At last, a total of 1043 patients were included, of whom 710 were from MIMIC-IV and 333 from MIMIC-III. Twenty-four clinically relevant parameters were selected for model construction. For the prediction of 28-day mortality of SALI in the internal validation set, the area under the curve (AUC (95% CI)) of RF was 0.79 (95% CI: 0.73–0.86), and which performed the best. Compared with the traditional disease severity scores including Oxford Acute Severity of Illness Score (OASIS), Sequential Organ Failure Assessment (SOFA), Simplified Acute Physiology Score II (SAPS II), Logistic Organ Dysfunction Score (LODS), Systemic Inflammatory Response Syndrome (SIRS), and Acute Physiology Score III (APS III), RF also had the best performance. SHAP analysis found that Urine output, Charlson Comorbidity Index (CCI), minimal Glasgow Coma Scale (GCS_min), blood urea nitrogen (BUN) and admission_age were the five most important features affecting RF model. Therefore, RF has good predictive ability for 28-day mortality prediction in SALI. Urine output, CCI, GCS_min, BUN and age at admission(admission_age) within 24 h after intensive care unit(ICU) admission contribute significantly to model prediction.

Development and validation of machine learning-based prediction model for severe pneumonia: A multicenter cohort study

A new haematological model for the diagnosis and prognosis of severe community-acquired pneumonia: a single-center retrospective study

Development and validation of a machine learning-based interpretable model for predicting sepsis by complete blood cell parameters

Risk factors analysis and prediction model construction for severe pneumonia in older adult patients

Clinical Characteristics Analysis and Construction of a Predictive Diagnostic Model of Community-Acquired Pneumonia in Adults Requiring Hospitalization in Fujian Provincial Hospital

Machine learning-based model for predicting the occurrence and mortality of nonpulmonary sepsis-associated ARDS

Machine Learning Models for Prediction of Severe Pneumocystis carinii Pneumonia after Kidney Transplantation: A Single-Center Retrospective Study

Development and validation of a predictive model for 30-day mortality in patients with severe community-acquired pneumonia in intensive care units

An early sepsis prediction model utilizing machine learning and unbalanced data processing in a clinical context

Early prediction of sepsis-induced respiratory tract infection using a biomarker-based machine-learning algorithm

Development of a model for predicting the severity of chronic obstructive pulmonary disease

A prediction model for hospital mortality in patients with severe community-acquired pneumonia and chronic obstructive pulmonary disease

A prediction and interpretation machine learning framework of mortality risk among severe infection patients with pseudomonas aeruginosa

Prediction of viral pneumonia based on machine learning models analyzing pulmonary inflammation index scores

Predictive model for acute respiratory distress syndrome events in ICU patients in China using machine learning algorithms: a secondary analysis of a cohort study

Novel biomarker panel for the diagnosis and prognosis assessment of sepsis based on machine learning.

Predicting sepsis in-hospital mortality with machine learning: a multi-center study using clinical and inflammatory biomarkers

An interpretable machine learning model for predicting 28-day mortality in patients with sepsis-associated liver injury

A Machine Learning Model for Accurate Prediction of Sepsis in ICU Patients

The mNCP-SPI Score Predicting Risk of Severe COVID-19 among Mild-Pneumonia Patients on Admission

Early Warning Models Using Machine Learning to Predict Sepsis-Associated Chronic Critical Illness: A Study Based on the Medical Information Mart for Intensive Care Database