Genomic data in the All of Us Research Program
All of Us Research Program Genomics Investigators,Alexander G Bick,Ginger A Metcalf,Kelsey R Mayo,Lee Lichtenstein,Shimon Rura,Robert J Carroll,Anjene Musick,Jodell E Linder,I King Jordan,Shashwat Deepali Nagar,Shivam Sharma,Robert Meller,Melissa Basford,Eric Boerwinkle,Mine S Cicek,Kimberly F Doheny,Evan E Eichler,Stacey Gabriel,Richard A Gibbs,David Glazer,Paul A Harris,Gail P Jarvik,Anthony Philippakis,Heidi L Rehm,Dan M Roden,Stephen N Thibodeau,Scott Topper,Ashley L Blegen,Samantha J Wirkus,Victoria A Wagner,Jeffrey G Meyer,Donna M Muzny,Eric Venner,Michelle Z Mawhinney,Sean M L Griffith,Elvin Hsu,Hua Ling,Marcia K Adams,Kimberly Walker,Jianhong Hu,Harsha Doddapaneni,Christie L Kovar,Mullai Murugan,Shannon Dugan,Ziad Khan,Niall J Lennon,Christina Austin-Tse,Eric Banks,Michael Gatzen,Namrata Gupta,Emma Henricks,Katie Larsson,Sheli McDonough,Steven M Harrison,Christopher Kachulis,Matthew S Lebo,Cynthia L Neben,Marcie Steeves,Alicia Y Zhou,Joshua D Smith,Christian D Frazar,Colleen P Davis,Karynne E Patterson,Marsha M Wheeler,Sean McGee,Christina M Lockwood,Brian H Shirts,Colin C Pritchard,Mitzi L Murray,Valeria Vasta,Dru Leistritz,Matthew A Richardson,Jillian G Buchan,Aparna Radhakrishnan,Niklas Krumm,Brenna W Ehmen,Sophie Schwartz,M Morgan T Aster,Kristian Cibulskis,Andrea Haessly,Rebecca Asch,Aurora Cremer,Kylee Degatano,Akum Shergill,Laura D Gauthier,Samuel K Lee,Aaron Hatcher,George B Grant,Genevieve R Brandt,Miguel Covarrubias,Ashley Able,Ashley E Green,Jennifer Zhang,Henry R Condon,Yuanyuan Wang,Moira K Dillon,C H Albach,Wail Baalawi,Seung Hoan Choi,Xin Wang,Elisabeth A Rosenthal,Andrea H Ramirez,Sokny Lim,Siddhartha Nambiar,Bradley Ozenberger,Anastasia L Wise,Chris Lunt,Geoffrey S Ginsburg,Joshua C Denny
DOI: https://doi.org/10.1038/s41586-023-06957-x
IF: 64.8
Nature
Abstract:Comprehensively mapping the genetic basis of human disease across diverse individuals is a long-standing goal for the field of human genetics1-4. The All of Us Research Program is a longitudinal cohort study aiming to enrol a diverse group of at least one million individuals across the USA to accelerate biomedical research and improve human health5,6. Here we describe the programme's genomics data release of 245,388 clinical-grade genome sequences. This resource is unique in its diversity as 77% of participants are from communities that are historically under-represented in biomedical research and 46% are individuals from under-represented racial and ethnic minorities. All of Us identified more than 1 billion genetic variants, including more than 275 million previously unreported genetic variants, more than 3.9 million of which had coding consequences. Leveraging linkage between genomic data and the longitudinal electronic health record, we evaluated 3,724 genetic variants associated with 117 diseases and found high replication rates across both participants of European ancestry and participants of African ancestry. Summary-level data are publicly available, and individual-level data can be accessed by researchers through the All of Us Researcher Workbench using a unique data passport model with a median time from initial researcher registration to data access of 29 hours. We anticipate that this diverse dataset will advance the promise of genomic medicine for all.