Integration of Rare Large-Effect Expression Variants Improves Polygenic Risk Prediction

Craig Smail,Nicole M. Ferraro,Matthew G. Durrant,Abhiram S. Rao,Matthew Aguirre,Xin Li,Michael J. Gloudemans,Themistocles L. Assimes,Charles Kooperberg,Alexander P. Reiner,Qin Hui,Jie Huang,Christopher J. O’Donnell,Yan V. Sun,Manuel A. Rivas,Stephen B. Montgomery
DOI: https://doi.org/10.1101/2020.12.02.20242990
2020-01-01
Abstract:SummaryPolygenic risk scores (PRS) aim to quantify the contribution of multiple genetic loci to an individual’s likelihood of a complex trait or disease. However, existing PRS estimate genetic liability using common genetic variants, excluding the impact of rare variants. We identified rare, large-effect variants in individuals with outlier gene expression from the GTEx project and then assessed their impact on PRS predictions in the UK Biobank (UKB). We observed large deviations from the PRS-predicted phenotypes for carriers of multiple outlier rare variants; for example, individuals classified as “low-risk” but in the top 1% of outlier rare variant burden had a 6-fold higher rate of severe obesity. We replicated these findings using data from the NHLBI Trans-Omics for Precision Medicine (TOPMed) biobank and the Million Veteran Program, and demonstrated that PRS across multiple traits will significantly benefit from the inclusion of rare genetic variants.
What problem does this paper attempt to address?