Abstract:Background Alzheimer's disease and related dementias (ADRD) and Parkinson's disease (PD) are the most common neurodegenerative conditions. These central nervous system disorders impact both the structure and function of the brain and may lead to imaging changes that precede symptoms. Patients with ADRD or PD have long asymptomatic phases that exhibit significant heterogeneity. Hence, quantitative measures that can provide early disease indicators are necessary to improve patient stratification, clinical care, and clinical trial design. This work uses machine learning techniques to derive such a quantitative marker from T1-weighted (T1w) brain Magnetic resonance imaging (MRI). Methods In this retrospective study, we developed machine learning (ML) based disease-specific scores based on T1w brain MRI utilizing Parkinson's Disease Progression Marker Initiative (PPMI) and Alzheimer's Disease Neuroimaging Initiative (ADNI) cohorts. We evaluated the potential of ML-based scores for early diagnosis, prognosis, and monitoring of ADRD and PD in an independent large-scale population-based longitudinal cohort, UK Biobank. Findings 1,826 dementia images from 731 participants, 3,161 healthy control images from 925 participants from the ADNI cohort, 684 PD images from 319 participants, and 232 healthy control images from 145 participants from the PPMI cohort were used to train machine learning models. The classification performance is 0.94 [95% CI: 0.93-0.96] area under the ROC Curve (AUC) for ADRD detection and 0.63 [95% CI: 0.57-0.71] for PD detection using 790 extracted structural brain features. The most predictive regions include the hippocampus and temporal brain regions in ADRD and the substantia nigra in PD. The normalized ML model's probabilistic output (ADRD and PD imaging scores) was evaluated on 42,835 participants with imaging data from the UK Biobank. There are 66 cases for ADRD and 40 PD cases whose T1 brain MRI is available during pre-diagnostic phases. For diagnosis occurrence events within 5 years, the integrated survival model achieves a time-dependent AUC of 0.86 [95% CI: 0.80-0.92] for dementia and 0.89 [95% CI: 0.85-0.94] for PD. ADRD imaging score is strongly associated with dementia-free survival (hazard ratio (HR) 1.76 [95% CI: 1.50-2.05] per S.D. of imaging score), and PD imaging score shows association with PD-free survival (hazard ratio 2.33 [95% CI: 1.55-3.50]) in our integrated model. HR and prevalence increased stepwise over imaging score quartiles for PD, demonstrating heterogeneity. As a proxy for diagnosis, we validated AD/PD polygenic risk scores of 42,835 subjects against the imaging scores, showing a highly significant association after adjusting for covariates. In both the PPMI and ADNI cohorts, the scores are associated with clinical assessments, including the Mini-Mental State Examination (MMSE), Alzheimer's Disease Assessment Scale-cognitive subscale (ADAS-Cog), and pathological markers, which include amyloid and tau. Finally, imaging scores are associated with polygenic risk scores for multiple diseases. Our results suggest that we can use imaging scores to assess the genetic architecture of such disorders in the future. Interpretation Our study demonstrates the use of quantitative markers generated using machine learning techniques for ADRD and PD. We show that disease probability scores obtained from brain structural features are useful for early detection, prognosis prediction, and monitoring disease progression. To facilitate community engagement and external tests of model utility, an interactive app to explore summary level data from this study and dive into external data can be found here https://ndds-brainimaging-ml.streamlit.app. As far as we know, this is the first publicly available cloud-based MRI prediction application. Funding US National Institute on Aging, and US National Institutes of Health.

TransferGWAS of T1-weighted brain MRI data from UK Biobank

TransferGWAS of T1-weighted Brain MRI Data from the UK Biobank

Efficient multi-phenotype genome-wide analysis identifies genetic associations for unsupervised deep-learning-derived high-dimensional brain imaging phenotypes

A novel classification framework for genome-wide association study of whole brain MRI images using deep learning

Identification of Disease-Sensitive Brain Imaging Phenotypes and Genetic Factors Using GWAS Summary Statistics

Genome-wide association studies of brain imaging phenotypes in UK Biobank

Increasing Power for Voxel-Wise Genome-Wide Association Studies: the Random Field Theory, Least Square Kernel Machines and Fast Permutation Procedures.

Multi-trait genome-wide analyses of the brain imaging phenotypes in UK Biobank

DeepWAS: Multivariate Genotype-Phenotype Associations by Directly Integrating Regulatory Information Using Deep Learning

Brain-Wide Genome-Wide Association Study for Alzheimer's Disease Via Joint Projection Learning and Sparse Regression Model

Identification of genetic basis of brain imaging by group sparse multi-task learning leveraging summary statistics

Deep Causal Feature Extraction and Inference with Neuroimaging Genetic Data.

Accelerating Heritability, Genetic Correlation, and Genome‐Wide Association Imaging Genetic Analyses in Complex Pedigrees

Accelerating Sparse Canonical Correlation Analysis for Large Brain Imaging Genetics Data

Mining Outcome-Relevant Brain Imaging Genetic Associations Via Three-Way Sparse Canonical Correlation Analysis in Alzheimer'S Disease

Prediction, prognosis and monitoring of neurodegeneration at biobank-scale via machine learning and imaging

Sparse Parallel Independent Component Analysis and Its Application to Identify Stable and Replicable Imaging-genomic Association Patterns in UK Biobank

An expanded set of genome-wide association studies of brain imaging phenotypes in UK Biobank

Large-scale GWAS Reveals Genetic Architecture of Brain White Matter Microstructure and Genetic Overlap with Cognitive and Mental Health Traits ( N = 17,706)

Transcriptome-wide association analysis of 211 neuroimaging traits identifies new genes for brain structures and yields insights into the gene-level pleiotropy with other complex traits