Multi-ancestry polygenic risk scores for venous thromboembolism

Yon Ho Jee,Florian Thibord,Alicia Dominguez,Corriene Sept,Kristin Boulier,Vidhya Venkateswaran,Yi Ding,Tess Cherlin,Shefali Setia Verma,Valeria Lo Faro,Traci M. Bartz,Anne Boland,Jennifer A. Brody,Jean-Francois Deleuze,Joseph Emmerich,Marine Germain,Andrew D. Johnson,Charles Kooperberg,Pierre-Emmanuel Morange,Nathan Pankratz,Bruce M. Psaty,Alexander P. Reiner,David M. Smadja,Colleen M. Sitlani,Pierre Suchon,Weihong Tang,David-Alexandre Trégouët,Sebastian Zöllner,Bogdan Pasaniuc,Scott M. Damrauer,Serena Sanna,Harold Snieder,Lifelines Cohort Study,Christopher Kabrhel,Nicholas L. Smith,Peter Kraft,INVENT Consortium
DOI: https://doi.org/10.1101/2024.01.09.24300914
2024-01-10
Abstract:Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium GWAS meta-analyses of European- (71,771 cases and 1,059,740 controls) and African-ancestry samples (7,482 cases and 129,975 controls). We used LDpred2 and PRSCSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6,261 cases and 88,238 controls) and African-ancestry sample (1,385 cases and 12,569 controls). Multi-ancestry PRSs with weights tuned in European- and African-ancestry samples, respectively, outperformed ancestry-specific PRSs in European- (PRSCSX : AUC=0.61 (0.60, 0.61), PRSCSX_combined : AUC=0.61 (0.60, 0.62)) and African-ancestry test samples (PRSCSX : AUC=0.58 (0.57, 0.6), PRSCSX_combined : AUC=0.59 (0.57, 0.60)). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS may be used to identify individuals at highest risk for VTE and provide guidance for the most effective treatment strategy across diverse populations.
Genetic and Genomic Medicine
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to develop and validate cross - ancestry polygenic risk scores (PRSs) for predicting the risk of venous thromboembolism (VTE). Specifically, the researchers focus on the following points: 1. **Improve the predictive ability of PRS in different populations**: Existing PRSs are mainly constructed based on the results of genome - wide association studies (GWAS) of European - ancestry samples, and their predictive ability for non - European - ancestry populations (such as those of African ancestry) is limited. This study aims to improve the predictive ability of PRS in different populations by using more complex construction methods and more diverse training data. 2. **Reduce health inequalities**: In the United States, the incidence of VTE is approximately 65% higher in Black Americans than in White Americans. Therefore, developing a VTE risk prediction model applicable to Black Americans can be used as a clinical tool to help reduce this health disparity. 3. **Identify high - risk individuals**: By constructing multi - ancestry PRSs, the researchers hope to more accurately identify individuals at high risk of VTE, thereby providing more effective prevention or treatment strategies for these individuals. To achieve these goals, the researchers used GWAS meta - analysis data from the International Network Against Venous Thrombosis (INVENT) consortium, including European - ancestry (71,771 VTE cases and 1,059,740 controls) and African - ancestry (7,482 VTE cases and 129,975 controls) samples. They adopted two Bayesian methods (LDpred2 and PRSCSx) to construct ancestry - specific and multi - ancestry PRSs and validated them in independent European - ancestry and African - ancestry samples. The research results show that multi - ancestry PRSs perform better than ancestry - specific PRSs in both European - ancestry and African - ancestry test samples, especially showing greater improvement in African - ancestry samples. This indicates that multi - ancestry PRSs have potential application value in identifying individuals at high risk of VTE and are helpful for guiding prevention and treatment strategies in different populations.