Genevieve L. Wojcik,Mariaelisa Graff,Katherine K. Nishimura,Ran Tao,Jeffrey Haessler,Christopher R. Gignoux,Heather M. Highland,Yesha M. Patel,Elena P. Sorokin,Christy L. Avery,Gillian M. Belbin,Stephanie A. Bien,Iona Cheng,Sinead Cullina,Chani J. Hodonsky,Yao Hu,Laura M. Huckins,Janina Jeff,Anne E. Justice,Jonathan M. Kocarnik,Unhee Lim,Bridget M. Lin,Yingchang Lu,Sarah C. Nelson,Sung-Shim L. Park,Hannah Poisner,Michael H. Preuss,Melissa A. Richard,Claudia Schurmann,Veronica W. Setiawan,Alexandra Sockell,Karan Vahi,Marie Verbanck,Abhishek Vishnu,Ryan W. Walker,Kristin L. Young,Niha Zubair,Victor Acuña-Alonso,Jose Luis Ambite,Kathleen C. Barnes,Eric Boerwinkle,Erwin P. Bottinger,Carlos D. Bustamante,Christian Caberto,Samuel Canizales-Quinteros,Matthew P. Conomos,Ewa Deelman,Ron Do,Kimberly Doheny,Lindsay Fernández-Rhodes,Myriam Fornage,Benyam Hailu,Gerardo Heiss,Brenna M. Henn,Lucia A. Hindorff,Rebecca D. Jackson,Cecelia A. Laurie,Cathy C. Laurie,Yuqing Li,Dan-Yu Lin,Andres Moreno-Estrada,Girish Nadkarni,Paul J. Norman,Loreall C. Pooler,Alexander P. Reiner,Jane Romm,Chiara Sabatti,Karla Sandoval,Xin Sheng,Eli A. Stahl,Daniel O. Stram,Timothy A. Thornton,Christina L. Wassel,Lynne R. Wilkens,Cheryl A. Winkler,Sachi Yoneyama,Steven Buyske,Christopher A. Haiman,Charles Kooperberg,Loic Le Marchand,Ruth J. F. Loos,Tara C. Matise,Kari E. North,Ulrike Peters,Eimear E. Kenny,Christopher S. Carlson

Abstract:Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry<a href="#ref-CR1">1</a>,<a href="#ref-CR2">2</a>,<a href="/articles/s41586-019-1310-4#ref-CR3">3</a>. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific<a href="#ref-CR4">4</a>,<a href="#ref-CR5">5</a>,<a href="#ref-CR6">6</a>,<a href="#ref-CR7">7</a>,<a href="#ref-CR8">8</a>,<a href="#ref-CR9">9</a>,<a href="/articles/s41586-019-1310-4#ref-CR10">10</a>. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations<a href="/articles/s41586-019-1310-4#ref-CR11">11</a>,<a href="/articles/s41586-019-1310-4#ref-CR12">12</a>. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States—where minority populations have a disproportionately higher burden of chronic conditions<a href="/articles/s41586-019-1310-4#ref-CR13">13</a>—the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.

Improving GWAS performance in underrepresented groups by appropriate modeling of genetics, environment, and sociocultural factors

Analyses of biomarker traits in diverse UK biobank participants identify associations missed by European-centric analysis strategies

Risk factors affecting polygenic score performance across diverse cohorts

Using Pre-training and Interaction Modeling for ancestry-specific disease prediction in UK Biobank

Improving GWAS discovery and genomic prediction accuracy in biobank data

Genetic analyses of diverse populations improves discovery for complex traits

Variable prediction accuracy of polygenic scores within an ancestry group

Family-GWAS reveals effects of environment and mating on genetic associations

Improving polygenic risk prediction in admixed populations by explicitly modeling ancestral-differential effects via GAUDI

All of Us diversity and scale improve polygenic prediction contextually with greatest improvements for under-represented populations

Leveraging trans-ethnic genetic risk scores to improve association power for complex traits in underrepresented populations

Pan-UK Biobank GWAS improves discovery, analysis of genetic architecture, and resolution into ancestry-enriched effects

Improving polygenic prediction in ancestrally diverse populations

XPXP: improving polygenic prediction by cross-population and cross-phenotype analysis

Harmonizing Genetic Ancestry and Self-identified Race/Ethnicity in Genome-wide Association Studies.

Multi-PGS enhances polygenic prediction by combining 937 polygenic scores

Quantifying Portable Genetic Effects and Improving Cross-Ancestry Genetic Prediction with GWAS Summary Statistics

Validity of European-centric cardiometabolic polygenic scores in multi-ancestry populations

Polygenic scoring accuracy varies across the genetic ancestry continuum

Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies

Polygenic Scores for Plasticity: A New Tool for Studying Gene-Environment Interplay