Machine learning using clinical data at baseline predicts the medium-term efficacy of ustekinumab in patients with ulcerative colitis

Hiromu Morikubo,Ryuta Tojima,Tsubasa Maeda,Katsuyoshi Matsuoka,Minoru Matsuura,Jun Miyoshi,Satoshi Tamura,Tadakazu Hisamatsu
DOI: https://doi.org/10.1038/s41598-024-55126-1
IF: 4.6
2024-02-23
Scientific Reports
Abstract:Predicting the therapeutic response to biologics before administration is a key clinical challenge in ulcerative colitis (UC). We previously reported a model for predicting the efficacy of vedolizumab (VDZ) for UC using a machine-learning approach. Ustekinumab (UST) is now available for treating UC, but no model for predicting its efficacy has been developed. When applied to patients with UC treated with UST, our VDZ prediction model showed positive predictive value (PPV) of 56.3% and negative predictive value (NPV) of 62.5%. Given this limited predictive ability, we aimed to develop a UST-specific prediction model with clinical features at baseline including background factors, clinical and endoscopic activity, and blood test results, as we did for the VDZ prediction model. The top 10 features (Alb, monocytes, height, MCV, TP, Lichtiger index, white blood cell count, MCHC, partial Mayo score, and CRP) associated with steroid-free clinical remission at 6 months after starting UST were selected using random forest. The predictive ability of a model using these predictors was evaluated by fivefold cross-validation. Validation of the prediction model with an external cohort showed PPV of 68.8% and NPV of 71.4%. Our study suggested the importance of establishing a drug-specific prediction model.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper attempts to address the issue of predicting the treatment efficacy of ustekinumab (UST) in patients with ulcerative colitis (UC). Specifically, the researchers found that previously developed models for predicting the efficacy of vedolizumab (VDZ) performed poorly in predicting UST efficacy. Therefore, they aimed to develop a prediction model specifically for UST. The researchers utilized baseline clinical characteristics, including background factors, clinical and endoscopic activity, and blood test results, to establish this model through machine learning methods. They used the random forest algorithm to screen the top 10 features related to steroid-free clinical remission (SFCR) and evaluated the model's predictive ability using these features. External cohort validation results showed that the model had high positive predictive value (PPV) and negative predictive value (NPV), indicating its effectiveness in actual clinical application. This further emphasizes the importance of developing specific prediction models for each drug.