Simon Gravel,Brenna M. Henn,Ryan N. Gutenkunst,Amit R. Indap,Gabor T. Marth,Andrew G. Clark,Fuli Yu,Richard A. Gibbs,Carlos D. Bustamante,David L. Altshuler,Richard M. Durbin,Gonçalo R. Abecasis,David R. Bentley,Aravinda Chakravarti,Francis S. Collins,Francisco M. De La Vega,Peter Donnelly,Michael Egholm,Paul Flicek,Stacey B. Gabriel,Bartha M. Knoppers,Eric S. Lander,Hans Lehrach,Elaine R. Mardis,Gil A. McVean,Debbie A. Nickerson,Leena Peltonen,Alan J. Schafer,Stephen T. Sherry,Jun Wang,Richard K. Wilson,David Deiros,Mike Metzker,Donna Muzny,Jeff Reid,David Wheeler,Jingxiang Li,Min Jian,Guoqing Li,Ruiqiang Li,Huiqing Liang,Geng Tian,Bo Wang,Jian Wang,Wei Wang,Huanming Yang,Xiuqing Zhang,Huisong Zheng,Lauren Ambrogio,Toby Bloom,Kristian Cibulskis,Tim J. Fennell,David B. Jaffe,Erica Shefler,Carrie L. Sougnez,Niall Gormley,Sean Humphray,Zoya Kingsbury,Paula Koko-Gonzales,Jennifer Stone,Kevin J. McKernan,Gina L. Costa,Jeffry K. Ichikawa,Clarence C. Lee,Ralf Sudbrak,Tatiana A. Borodina,Andreas Dahl,Alexey N. Davydov,Peter Marquardt,Florian Mertes,Wilfiried Nietfeld,Philip Rosenstiel,Stefan Schreiber,Aleksey V. Soldatov,Bernd Timmermann,Marius Tolzmann,Jason Affourtit,Dana Ashworth,Said Attiya,Melissa Bachorski,Eli Buglione,Adam Burke,Amanda Caprio,Christopher Celone,Shauna Clark,David Conners,Brian Desany,Lisa Gu,Lorri Guccione,Kalvin Kao,Andrew Kebbel,Jennifer Knowlton,Matthew Labrecque,Louise McDade,Craig Mealmaker,Melissa Minderman,Anne Nawrocki,Faheem Niazi,Kristen Pareja,Ravi Ramenani,David Riches,Wanmin Song,Cynthia Turcotte,Shally Wang,David Dooling,Lucinda Fulton,Robert Fulton,George Weinstock,John Burton,David M. Carter,Carol Churcher,Alison Coffey,Anthony Cox,Aarno Palotie,Michael Quail,Tom Skelly,James Stalker,Harold P. Swerdlow,Daniel Turner,Anniek De Witte,Shane Giles,Matthew Bainbridge,Danny Challis,Aniko Sabo,Jin Yu,Xiaodong Fang,Xiaosen Guo,Yingrui Li,Ruibang Luo,Shuaishuai Tai,Honglong Wu,Hancheng Zheng,Xiaole Zheng,Yan Zhou,Erik P. Garrison,Weichun Huang,Amit Indap,Deniz Kural,Wan-Ping Lee,Wen Fung Leong,Aaron R. Quinlan,Chip Stewart,Michael P. Stromberg,Alistair N. Ward,Jiantao Wu,Charles Lee,Ryan E. Mills,Xinghua Shi,Mark J. Daly,Mark A. DePristo,Aaron D. Ball,Eric Banks,Brian L. Browning,Kiran V. Garimella,Sharon R. Grossman,Robert E. Handsaker,Matt Hanna,Chris Hartl,Andrew M. Kernytsky,Joshua M. Korn,Heng Li,Jared R. Maguire,Steven A. McCarroll,Aaron McKenna,James C. Nemesh,Anthony A. Philippakis,Ryan E. Poplin,Alkes Price,Manuel A. Rivas,Pardis C. Sabeti,Stephen F. Schaffner,Ilya A. Shlyakhter,David N. Cooper,Edward V. Ball,Matthew Mort,Andrew D. Phillips,Peter D. Stenson,Jonathan Sebat,Vladimir Makarov,Kenny Ye,Seungtai C. Yoon,Adam Boyko,Jeremiah Degenhardt,Mark Kaganovich,Alon Keinan,Phil Lacroute,Xin Ma,Andy Reynolds,Laura Clarke,Fiona Cunningham,Javier Herrero,Stephen Keenen,Eugene Kulesha,Rasko Leinonen,William M. McLaren,Rajesh Radhakrishnan,Richard E. Smith,Vadim Zalunin,Xiangqun Zheng-Bradley,Jan O. Korbel,Adrian M. Stütz,Markus Bauer,R. Keira Cheetham,Tony Cox,Michael Eberle,Terena James,Scott Kahn,Lisa Murray,Kai Ye,Yutao Fu,Fiona C. L. Hyland,Jonathan M. Manning,Stephen F. McLaughlin,Heather E. Peckham,Onur Sakarya,Yongming A. Sun,Eric F. Tsung,Mark A. Batzer,Miriam K. Konkel,Jerilyn A. Walker,Marcus W. Albrecht,Vyacheslav S. Amstislavskiy,Ralf Herwig,Dimitri V. Parkhomchuk,Richa Agarwala,Hoda M. Khouri,Aleksandr O. Morgulis,Justin E. Paschall,Lon D. Phan,Kirill E. Rotmistrovsky,Robert D. Sanders,Martin F. Shumway,Chunlin Xiao,Adam Auton,Zamin Iqbal,Gerton Lunter,Jonathan L. Marchini,Loukas Moutsianas,Simon Myers,Afidalina Tumian,James Knight,Roger Winer,David W. Craig,Steve M. Beckstrom-Sternberg,Alexis Christoforides,Ahmet A. Kurdoglu,John V. Pearson,Shripad A. Sinari,Waibhav D. Tembe,David Haussler,Angie S. Hinrichs,Sol J. Katzman,Andrew Kern,Robert M. Kuhn,Molly Przeworski,Ryan D. Hernandez,Bryan Howie,Joanna L. Kelley,S. Cord Melton,Yun Li,Paul Anderson,Tom Blackwell,Wei Chen,William O. Cookson,Jun Ding,Hyun Min Kang,Mark Lathrop,Liming Liang,Miriam F. Moffatt,Paul Scheet,Carlo Sidore,Matthew Snyder,Xiaowei Zhan,Sebastian Zöllner,Philip Awadalla,Ferran Casals,Youssef Idaghdour,John Keebler,Eric A. Stone,Martine Zilversmit,Lynn Jorde,Jinchuan Xing,Evan E. Eichler,Gozde Aksay,Can Alkan,Iman Hajirasouliha,Fereydoun Hormozdiari,Jeffrey M. Kidd,S. Cenk Sahinalp,Peter H. Sudmant,Ken Chen,Asif Chinwalla,Li Ding,Daniel C. Koboldt,Mike D. McLellan,John W. Wallis,Michael C. Wendl,Qunyuan Zhang,Cornelis A. Albers,Qasim Ayub,Senduran Balasubramaniam,Jeffrey C. Barrett,Yuan Chen,Donald F. Conrad,Petr Danecek,Emmanouil T. Dermitzakis,Min Hu,Ni Huang,Matt E. Hurles,Hanjun Jin,Luke Jostins,Thomas M. Keane,Si Quang Le,Sarah Lindsay,Quan Long,Daniel G. MacArthur,Stephen B. Montgomery,Leopold Parts,Chris Tyler-Smith,Klaudia Walter,Yujun Zhang,Mark B. Gerstein,Michael Snyder,Alexej Abyzov,Suganthi Balasubramanian,Robert Bjornson,Jiang Du,Fabian Grubert,Lukas Habegger,Rajini Haraksingh,Justin Jee,Ekta Khurana,Hugo Y. K. Lam,Jing Leng,Xinmeng Jasmine Mu,Alexander E. Urban,Zhengdong Zhang,Cristian Coafra,Huyen Dinh,Christie Kovar,Sandy Lee,Lynne Nazareth,Jane Wilkinson,Allison Coffey,Carol Scott,Neda Gharani,Jane S. Kaye,Alastair Kent,Taosha Li,Amy L. McGuire,Pilar N. Ossorio,Charles N. Rotimi,Yeyang Su,Lorraine H. Toji,Chris TylerSmith,Lisa D. Brooks,Adam L. Felsenfeld,Jean E. McEwen,Assya Abdallah,Christopher R. Juenger,Nicholas C. Clemm,Audrey Duncanson,Eric D. Green,Mark S. Guyer,Jane L. Peterson

Abstract:High-throughput sequencing technology enables population-level surveys of human genomic variation. Here, we examine the joint allele frequency distributions across continental human populations and present an approach for combining complementary aspects of whole-genome, low-coverage data and targeted high-coverage data. We apply this approach to data generated by the pilot phase of the Thousand Genomes Project, including whole-genome 2–4× coverage data for 179 samples from HapMap European, Asian, and African panels as well as high-coverage target sequencing of the exons of 800 genes from 697 individuals in seven populations. We use the site frequency spectra obtained from these data to infer demographic parameters for an Out-of-Africa model for populations of African, European, and Asian descent and to predict, by a jackknife-based approach, the amount of genetic diversity that will be discovered as sample sizes are increased. We predict that the number of discovered nonsynonymous coding variants will reach 100,000 in each population after ∼1,000 sequenced chromosomes per population, whereas ∼2,500 chromosomes will be needed for the same number of synonymous variants. Beyond this point, the number of segregating sites in the European and Asian panel populations is expected to overcome that of the African panel because of faster recent population growth. Overall, we find that the majority of human genomic variable sites are rare and exhibit little sharing among diverged populations. Our results emphasize that replication of disease association for specific rare genetic variants across diverged populations must overcome both reduced statistical power because of rarity and higher population divergence.

Accurate Whole Human Genome Sequencing Using Reversible Terminator Chemistry.

The Sequence of the Human Genome

Demographic History and Rare Allele Sharing among Human Populations

The complete sequence of a human genome

Accurate whole-genome sequencing and haplotyping from 10 to 20 human cells

The DNA Sequence of Human Chromosome 22.

Genome sequencing in microfabricated high-density picolitre reactors

A User's Guide to the Encyclopedia of DNA Elements (ENCODE).

High-coverage nanopore sequencing of samples from the 1000 Genomes Project to build a comprehensive catalog of human genetic variation

Nanopore sequencing of 1000 Genomes Project samples to build a comprehensive catalog of human genetic variation

Genome Sequencing In Microfabricated High-Density Picolitre Reactors (Vol 437, Pg 376, 2005)

A second generation human haplotype map of over 3.1 million SNPs

An International Effort Towards Developing Standards for Best Practices in Analysis, Interpretation and Reporting of Clinical Genome Sequencing Results in the CLARITY Challenge

De novo assembly of 64 haplotype-resolved human genomes of diverse ancestry and integrated analysis of structural variation

Clonal Decomposition and DNA Replication States Defined by Scaled Single-Cell Genome Sequencing

Nanopore sequencing and assembly of a human genome with ultra-long reads

Rare Coding Variants in 35 Genes Associate with Circulating Lipid Levels—a Multi-Ancestry Analysis of 170,000 Exomes

Harmonizing Clinical Sequencing and Interpretation for the eMERGE III Network

Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation

A genomic mutational constraint map using variation in 76,156 human genomes

Semi-automated assembly of high-quality diploid human reference genomes