Development of machine learning models for predicting outcome in patients with distal medium vessel occlusions: a retrospective study
Burak Berksu Ozkara,Mert Karabacak,Apoorva Kotha,Brian Cooper Cristiano,Max Wintermark,Vivek Srikar Yedavalli
DOI: https://doi.org/10.21037/qims-23-154
2023-09-01
Abstract:Background: While numerous prognostic factors have been reported for large vessel occlusion (LVO)-acute ischemic stroke (AIS) patients, the same cannot be said for distal medium vessel occlusions (DMVOs). We used machine learning (ML) algorithms to develop a model predicting the short-term outcome of AIS patients with DMVOs using demographic, clinical, and laboratory variables and baseline computed tomography (CT) perfusion (CTP) postprocessing quantitative parameters. Methods: In this retrospective cohort study, consecutive patients with AIS admitted to two comprehensive stroke centers between January 1, 2017, and September 1, 2022, were screened. Demographic, clinical, and radiological data were extracted from electronic medical records. The clinical outcome was divided into two categories, with a cut-off defined by the median National Institutes of Health Stroke Scale (NIHSS) shift score. Data preprocessing involved addressing missing values through imputation, scaling with a robust scaler, normalization using min-max normalization, and encoding of categorical variables. The data were split into training and test sets (70:30), and recursive feature elimination (RFE) was employed for feature selection. For ML analyses, XGBoost, LightGBM, CatBoost, multi-layer perceptron, random forest, and logistic regression algorithms were utilized. Performance evaluation involved the receiver operating characteristic (ROC) curve, precision-recall curve (PRC), the area under these curves, accuracy, precision, recall, and Matthews correlation coefficient (MCC). The relative weights of predictor variables were examined using Shapley additive explanations (SHAP). Results: Sixty-nine patients were included and divided into two groups: 35 patients with favorable outcomes and 34 patients with unfavorable outcomes. Utilizing ten selected features, the XGBoost algorithm achieved the best performance in predicting unfavorable outcomes, with an area under the ROC curve (AUROC) of 0.894 and an area under the PRC curve (AUPRC) of 0.756. The SHAP analysis ranked the following features in order of importance for the XGBoost model: mismatch volume, time-to-maximum of the tissue residue function (Tmax) >6 s, diffusion-weighted imaging (DWI) volume, neutrophil-to-platelet ratio (NPR), mean corpuscular volume (MCV), Tmax >10 s, hemoglobin, potassium, hypoperfusion index (HI), and Tmax >8 s. Conclusions: Our ML models, trained on baseline quantitative laboratory and CT parameters, accurately predicted the short-term outcome in patients with DMVOs. These findings may aid clinicians in predicting prognosis and may be helpful for future research.
What problem does this paper attempt to address?