A Mendelian randomization study identifies proteins involved in neurodegenerative diseases

Lazaros Belbasis,Sam Morris,Cornelia van Duijn,Derrick Bennett,Robin Walters
DOI: https://doi.org/10.1101/2024.02.24.24303314
2024-02-27
Abstract:Proteins are involved in multiple biological functions. High-throughput technologies have allowed the measurement of thousands of proteins in population biobanks. In this study, we aimed to identify proteins related to Alzheimer’s disease (AD), Parkinson’s disease (PD), Multiple Sclerosis (MS) and Amyotrophic Lateral Sclerosis (ALS) using large-scale genetic and proteomic data. We performed a two-sample Mendelian randomization (MR) study by selecting instrumental variables for the abundance of over 2,700 proteins measured by either Olink or SomaScan platforms in plasma from UK Biobank and the deCODE Health Study. We also used the latest publicly-available GWAS for the diseases of interest. The potentially causal effect of proteins on neurodegenerative diseases was estimated based on the Wald ratio. We tested 10,244 protein–disease associations, identifying 122 associations which were statistically significant (5% false discovery rate). Out of 57 associations (58%) tested using an instrumental variable from both Olink and SomaScan platforms, 33 (58%) were statistically significant in both platforms. Evidence of co-localisation between plasma protein abundance and disease risk (posterior probability >0.80) was identified for 46 protein-disease pairs. Twenty-three out of 46 protein–disease associations correspond to genetic loci not previously reported by genome-wide association studies. The newly-associated proteins for AD are involved in complement ( , ), microglia ( , ) and lysosomal functions ( ). A protein newly-associated with PD ( ) is involved in the interleukin-6 pathway, two proteins for ALS ( , ) are involved in lysosomal and astrocyte function, respectively, and proteins associated with MS are involved in blood–brain barrier function ( , ), the oligodendrocyte function ( ), the structure of the node of Ranvier and function of dorsal root ganglion ( , , ), and the response to viral infections including Epstein-Barr virus ( , ). Our study demonstrates how harnessing large-scale genomic and proteomic data can yield novel insights into the role of plasma proteome in the pathogenesis of neurodegenerative diseases.
Neurology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to identify proteins associated with Alzheimer's disease (AD), Parkinson's disease (PD), multiple sclerosis (MS) and amyotrophic lateral sclerosis (ALS) through large - scale genetic and proteomic data. Specifically, the researchers used the Mendelian Randomization (MR) method. By selecting the abundances of more than 2,700 proteins in the UK Biobank and deCODE health studies as instrumental variables and combining the latest genome - wide association study (GWAS) data on neurodegenerative diseases, they estimated the potential causal effects of these proteins on neurodegenerative diseases. The main objectives of the study include: 1. **Identifying related proteins**: Identify proteins significantly associated with the above four neurodegenerative diseases through a two - sample cis - Mendelian randomization study. 2. **Verifying causal relationships**: Use the Wald ratio to estimate the impact of proteins on neurodegenerative diseases and control the false discovery rate (FDR) through multiple - testing correction (such as the Benjamini - Hochberg method) to determine statistically significant protein - disease associations. 3. **Evaluating reverse causal relationships**: Explore possible reverse causal relationships, that is, whether the disease affects protein levels, through two - way Mendelian randomization analysis. 4. **Colocalization analysis**: Conduct Bayesian colocalization analysis to reduce confounding factors caused by linkage disequilibrium (LD) and further verify the validity of instrumental variables and the robustness of protein - disease associations. Through these methods, the study aims to provide new insights, reveal the role of the plasma proteome in the pathogenesis of neurodegenerative diseases, and may provide a basis for the development of new drug targets and treatment strategies.