MRLocus: Identifying causal genes mediating a trait through Bayesian estimation of allelic heterogeneity

Anqi Zhu,Nana Matoba,Emma P. Wilson,Amanda L. Tapia,Yun Li,Joseph G. Ibrahim,Jason L. Stein,Michael I. Love
DOI: https://doi.org/10.1371/journal.pgen.1009455
IF: 4.5
2021-01-01
PLoS Genetics
Abstract:Expression quantitative trait loci (eQTL) studies are used to understand the regulatory function of non-coding genome-wide association study (GWAS) risk loci, but colocalization alone does not demonstrate a causal relationship of gene expression affecting a trait. Evidence for mediation, that perturbation of gene expression in a given tissue or developmental context will induce a change in the downstream GWAS trait, can be provided by two-sample Mendelian Randomization (MR). Here, we introduce a new statistical method, MRLocus, for Bayesian estimation of the gene-to-trait effect from eQTL and GWAS summary data for loci with evidence of allelic heterogeneity, that is, containing multiple causal variants. MRLocus makes use of a colocalization step applied to each nearly-LD-independent eQTL, followed by an MR analysis step across eQTLs. Additionally, our method involves estimation of the extent of allelic heterogeneity through a dispersion parameter, indicating variable mediation effects from each individual eQTL on the downstream trait. Our method is evaluated against other state-of-the-art methods for estimation of the gene-to-trait mediation effect, using an existing simulation framework. In simulation, MRLocus often has the highest accuracy among competing methods, and in each case provides more accurate estimation of uncertainty as assessed through interval coverage. MRLocus is then applied to five candidate causal genes for mediation of particular GWAS traits, where gene-to-trait effects are concordant with those previously reported. We find that MRLocus's estimation of the causal effect across eQTLs within a locus provides useful information for determining how perturbation of gene expression or individual regulatory elements will affect downstream traits. The MRLocus method is implemented as an R package available at . Author summary Genome-wide association studies (GWAS) have identified many loci associated with complex traits and diseases. Expression quantitative trait loci (eQTL) may help to explain mechanisms of GWAS associations, if the gene has a role as a mediator of the trait or disease. Loci that exhibit allelic heterogeneity, that is, loci containing multiple causal variants, offer the opportunity to investigate whether effects are concordant and proportional across eQTL and GWAS; if the gene is a partial mediator of the trait, the sign and size of the effects across distinct eQTL variants should be reflected in GWAS associations. Such a Mendelian Randomization (MR) analysis of individual loci is complicated by moderate sample sizes in eQTL studies and linkage disequilibrium (LD), resulting in complex patterns of estimated effect sizes for eQTL and GWAS. We develop a statistical model, MRLocus, with two steps: selection of eQTL SNPs to act as instruments in the MR analysis of a genetic locus, and estimation of the gene-to-trait mediation effect taking instrument uncertainty into account. In simulation, the method has higher accuracy and better uncertainty measures compared to other competing methods, and we compare its estimates on candidate causal gene-trait pairs from literature.
What problem does this paper attempt to address?