Opportunities and Challenges for Transcriptome-Wide Association Studies
Michael Wainberg,Nasa Sinnott-Armstrong,Nicholas Mancuso,Alvaro N. Barbeira,David A. Knowles,David Golan,Raili Ermel,Arno Ruusalepp,Thomas Quertermous,Ke Hao,Johan L. M. Björkegren,Hae Kyung Im,Bogdan Pasaniuc,Manuel A. Rivas,Anshul Kundaje
DOI: https://doi.org/10.1038/s41588-019-0385-z
IF: 30.8
2019-01-01
Nature Genetics
Abstract:Transcriptome-wide association studies (TWAS) integrate GWAS and gene expression datasets to find gene-trait associations. In this Perspective, we explore properties of TWAS as a potential approach to prioritize causal genes, using simulations and case studies of literature-curated candidate causal genes for schizophrenia, LDL cholesterol and Crohn’s disease. We explore risk loci where TWAS accurately prioritizes the likely causal gene, as well as loci where TWAS prioritizes multiple genes, some of which are unlikely to be causal, because they share the same variants as eQTLs. We illustrate that TWAS is especially prone to spurious prioritization when using expression data from tissues or cell types that are less related to the trait, due to substantial variation in both expression levels and eQTL strengths across cell types. Nonetheless, TWAS prioritizes candidate causal genes at GWAS loci more accurately than simple baselines based on proximity to lead GWAS variant and expression in trait-related tissue. We discuss current strategies and future opportunities for improving the performance of TWAS for causal gene prioritization. Our results showcase the strengths and limitations of using expression variation across individuals to determine causal genes at GWAS loci and provide guidelines and best practices when using TWAS to prioritize candidate causal genes.