Artificial intelligence for diagnostic and prognostic neuroimaging in dementia: A systematic review
Robin J Borchert,Tiago Azevedo,AmanPreet Badhwar,Jose Bernal,Matthew Betts,Rose Bruffaerts,Michael C Burkhart,Ilse Dewachter,Helena M Gellersen,Audrey Low,Ilianna Lourida,Luiza Machado,Christopher R Madan,Maura Malpetti,Jhony Mejia,Sofia Michopoulou,Carlos Muñoz-Neira,Jack Pepys,Marion Peres,Veronica Phillips,Siddharth Ramanan,Stefano Tamburin,Hanz M Tantiangco,Lokendra Thakur,Alessandro Tomassini,Ashwati Vipin,Eugene Tang,Danielle Newby,Deep Dementia Phenotyping (DEMON) Network,Janice M Ranson,David J Llewellyn,Michele Veldsman,Timothy Rittman
DOI: https://doi.org/10.1002/alz.13412
Abstract:Introduction: Artificial intelligence (AI) and neuroimaging offer new opportunities for diagnosis and prognosis of dementia. Methods: We systematically reviewed studies reporting AI for neuroimaging in diagnosis and/or prognosis of cognitive neurodegenerative diseases. Results: A total of 255 studies were identified. Most studies relied on the Alzheimer's Disease Neuroimaging Initiative dataset. Algorithmic classifiers were the most commonly used AI method (48%) and discriminative models performed best for differentiating Alzheimer's disease from controls. The accuracy of algorithms varied with the patient cohort, imaging modalities, and stratifiers used. Few studies performed validation in an independent cohort. Discussion: The literature has several methodological limitations including lack of sufficient algorithm development descriptions and standard definitions. We make recommendations to improve model validation including addressing key clinical questions, providing sufficient description of AI methods and validating findings in independent datasets. Collaborative approaches between experts in AI and medicine will help achieve the promising potential of AI tools in practice. Highlights: There has been a rapid expansion in the use of machine learning for diagnosis and prognosis in neurodegenerative disease Most studies (71%) relied on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset with no other individual dataset used more than five times There has been a recent rise in the use of more complex discriminative models (e.g., neural networks) that performed better than other classifiers for classification of AD vs healthy controls We make recommendations to address methodological considerations, addressing key clinical questions, and validation We also make recommendations for the field more broadly to standardize outcome measures, address gaps in the literature, and monitor sources of bias.