Prediction of Post-Stroke Urinary Tract Infection Risk in Immobile Patients Using Machine Learning: an Observational Cohort Study.

C. Zhu,Z. Xu,Y. Gu,S. Zheng,X. Sun,J. Cao,B. Song,J. Jin,Y. Liu,X. Wen,S. Cheng,J. Li,X. Wu
DOI: https://doi.org/10.1016/j.jhin.2022.01.002
IF: 8.944
2022-01-01
Journal of Hospital Infection
Abstract:BACKGROUND:Urinary tract infection (UTI) is one of major nosocomial infections significantly affecting the outcomes of immobile stroke patients. Previous studies have identified several risk factors, but it is still challenging to accurately estimate personal UTI risk.AIM:To develop predictive models for UTI risk identification for immobile stroke patients.METHODS:Research data were collected from our previous multicentre study. Derivation cohort included 3982 immobile stroke patients collected from November 1st, 2015 to June 30th, 2016; external validation cohort included 3837 patients collected from November 1st, 2016 to July 30th, 2017. Six machine learning models and an ensemble learning model were derived, based on 80% of derivation cohort, and effectiveness was evaluated with the remaining 20%. Shapley additive explanation values were used to determine feature importance and examine the clinical significance of prediction models.FINDINGS:In all, 2.59% (103/3982) patients were diagnosed with UTI in derivation cohort, 1.38% (53/3837) in external cohort. The ensemble learning model performed the best in area under the receiver operating characteristic (ROC) curve in internal validation (82.2%); second best in external validation (80.8%). In addition, the ensemble learning model performed the best sensitivity in both internal and external validation sets (80.9% and 81.1%, respectively). Seven UTI risk factors (pneumonia, glucocorticoid use, female sex, mixed cerebrovascular disease, increased age, prolonged length of stay, and duration of catheterization) were also identified.CONCLUSION:This ensemble learning model demonstrated promising performance. Future work should continue to develop a more concise scoring tool based on machine learning models and prospectively examining the model in practical use, thus improving clinical outcomes.
What problem does this paper attempt to address?