Global and Regional Deep Learning Models for Multiple Sclerosis Stratification From MRI
Llucia Coll,Deborah Pareto,Pere Carbonell-Mirabent,Álvaro Cobo-Calvo,Georgina Arrambide,Ángela Vidal-Jordana,Manuel Comabella,Joaquín Castilló,Breogán Rodrı Guez-Acevedo,Ana Zabalza,Ingrid Galán,Luciana Midaglia,Carlos Nos,Cristina Auger,Manel Alberich,Jordi Río,Jaume Sastre-Garriga,Arnau Oliver,Xavier Montalban,Àlex Rovira,Mar Tintoré,Xavier Lladó,Carmen Tur
DOI: https://doi.org/10.1002/jmri.29046
Abstract:Background: The combination of anatomical MRI and deep learning-based methods such as convolutional neural networks (CNNs) is a promising strategy to build predictive models of multiple sclerosis (MS) prognosis. However, studies assessing the effect of different input strategies on model's performance are lacking. Purpose: To compare whole-brain input sampling strategies and regional/specific-tissue strategies, which focus on a priori known relevant areas for disability accrual, to stratify MS patients based on their disability level. Study type: Retrospective. Subjects: Three hundred nineteen MS patients (382 brain MRI scans) with clinical assessment of disability level performed within the following 6 months (~70% training/~15% validation/~15% inference in-house dataset) and 440 MS patients from multiple centers (independent external validation cohort). Field strength/sequence: Single vendor 1.5 T or 3.0 T. Magnetization-Prepared Rapid Gradient-Echo and Fluid-Attenuated Inversion Recovery sequences. Assessment: A 7-fold patient cross validation strategy was used to train a 3D-CNN to classify patients into two groups, Expanded Disability Status Scale score (EDSS) ≥ 3.0 or EDSS < 3.0. Two strategies were investigated: 1) a global approach, taking the whole brain volume as input and 2) regional approaches using five different regions-of-interest: white matter, gray matter, subcortical gray matter, ventricles, and brainstem structures. The performance of the models was assessed in the in-house and the independent external cohorts. Statistical tests: Balanced accuracy, sensitivity, specificity, area under receiver operating characteristic (ROC) curve (AUC). Results: With the in-house dataset, the gray matter regional model showed the highest stratification accuracy (81%), followed by the global approach (79%). In the external dataset, without any further retraining, an accuracy of 72% was achieved for the white matter model and 71% for the global approach. Data conclusion: The global approach offered the best trade-off between internal performance and external validation to stratify MS patients based on accumulated disability. Evidence level: 4 TECHNICAL EFFICACY: Stage 2.