A Fast Algorithm for Bayesian Multi-Locus Model in Genome-Wide Association Studies

Weiwei Duan,Yang Zhao,Yongyue Wei,Sheng Yang,Jianling Bai,Sipeng Shen,Mulong Du,Lihong Huang,Zhibin Hu,Feng Chen
DOI: https://doi.org/10.1007/s00438-017-1322-4
2017-01-01
Zeitschrift für Induktive Abstammungs- und Vererbungslehre
Abstract:Genome-wide association studies (GWAS) have identified a large amount of single-nucleotide polymorphisms (SNPs) associated with complex traits. A recently developed linear mixed model for estimating heritability by simultaneously fitting all SNPs suggests that common variants can explain a substantial fraction of heritability, which hints at the low power of single variant analysis typically used in GWAS. Consequently, many multi-locus shrinkage models have been proposed under a Bayesian framework. However, most use Markov Chain Monte Carlo (MCMC) algorithm, which are time-consuming and challenging to apply to GWAS data. Here, we propose a fast algorithm of Bayesian adaptive lasso using variational inference (BAL-VI). Extensive simulations and real data analysis indicate that our model outperforms the well-known Bayesian lasso and Bayesian adaptive lasso models in accuracy and speed. BAL-VI can complete a simultaneous analysis of a lung cancer GWAS data with ~3400 subjects and ~570,000 SNPs in about half a day.
What problem does this paper attempt to address?