A Genome-Wide Association Study of Alzheimer’s Disease Using Random Forests and Enrichment Analysis

Liang Zou,Qiong Huang,Ao Li,MingHui Wang
DOI: https://doi.org/10.1007/s11427-012-4343-6
2012-01-01
Scientia Sinica Vitae
Abstract:Alzheimer’s disease (AD) is a serious neurodegenerative disorder and its cause remains largely elusive. In past years, genome-wide association (GWA) studies have provided an effective means for AD research. However, the univariate method that is commonly used in GWA studies cannot effectively detect the biological mechanisms associated with this disease. In this study, we propose a new strategy for the GWA analysis of AD that combines random forests with enrichment analysis. First, backward feature selection using random forests was performed on a GWA dataset of AD patients carrying the apolipoprotein gene (APOEɛ4) and 1058 susceptible single nucleotide polymorphisms (SNPs) were detected, including several known AD-associated SNPs. Next, the susceptible SNPs were investigated by enrichment analysis and significantly-associated gene functional annotations, such as ‘alternative splicing’, ‘glycoprotein’, and ‘neuron development’, were successfully discovered, indicating that these biological mechanisms play important roles in the development of AD in APOEɛ4 carriers. These findings may provide insights into the pathogenesis of AD and helpful guidance for further studies. Furthermore, this strategy can easily be modified and applied to GWA studies of other complex diseases.
What problem does this paper attempt to address?