Demographic History and Rare Allele Sharing among Human Populations

Simon Gravel,Brenna M. Henn,Ryan N. Gutenkunst,Amit R. Indap,Gabor T. Marth,Andrew G. Clark,Fuli Yu,Richard A. Gibbs,Carlos D. Bustamante,David L. Altshuler,Richard M. Durbin,Gonçalo R. Abecasis,David R. Bentley,Aravinda Chakravarti,Francis S. Collins,Francisco M. De La Vega,Peter Donnelly,Michael Egholm,Paul Flicek,Stacey B. Gabriel,Bartha M. Knoppers,Eric S. Lander,Hans Lehrach,Elaine R. Mardis,Gil A. McVean,Debbie A. Nickerson,Leena Peltonen,Alan J. Schafer,Stephen T. Sherry,Jun Wang,Richard K. Wilson,David Deiros,Mike Metzker,Donna Muzny,Jeff Reid,David Wheeler,Jingxiang Li,Min Jian,Guoqing Li,Ruiqiang Li,Huiqing Liang,Geng Tian,Bo Wang,Jian Wang,Wei Wang,Huanming Yang,Xiuqing Zhang,Huisong Zheng,Lauren Ambrogio,Toby Bloom,Kristian Cibulskis,Tim J. Fennell,David B. Jaffe,Erica Shefler,Carrie L. Sougnez,Niall Gormley,Sean Humphray,Zoya Kingsbury,Paula Koko-Gonzales,Jennifer Stone,Kevin J. McKernan,Gina L. Costa,Jeffry K. Ichikawa,Clarence C. Lee,Ralf Sudbrak,Tatiana A. Borodina,Andreas Dahl,Alexey N. Davydov,Peter Marquardt,Florian Mertes,Wilfiried Nietfeld,Philip Rosenstiel,Stefan Schreiber,Aleksey V. Soldatov,Bernd Timmermann,Marius Tolzmann,Jason Affourtit,Dana Ashworth,Said Attiya,Melissa Bachorski,Eli Buglione,Adam Burke,Amanda Caprio,Christopher Celone,Shauna Clark,David Conners,Brian Desany,Lisa Gu,Lorri Guccione,Kalvin Kao,Andrew Kebbel,Jennifer Knowlton,Matthew Labrecque,Louise McDade,Craig Mealmaker,Melissa Minderman,Anne Nawrocki,Faheem Niazi,Kristen Pareja,Ravi Ramenani,David Riches,Wanmin Song,Cynthia Turcotte,Shally Wang,David Dooling,Lucinda Fulton,Robert Fulton,George Weinstock,John Burton,David M. Carter,Carol Churcher,Alison Coffey,Anthony Cox,Aarno Palotie,Michael Quail,Tom Skelly,James Stalker,Harold P. Swerdlow,Daniel Turner,Anniek De Witte,Shane Giles,Matthew Bainbridge,Danny Challis,Aniko Sabo,Jin Yu,Xiaodong Fang,Xiaosen Guo,Yingrui Li,Ruibang Luo,Shuaishuai Tai,Honglong Wu,Hancheng Zheng,Xiaole Zheng,Yan Zhou,Erik P. Garrison,Weichun Huang,Amit Indap,Deniz Kural,Wan-Ping Lee,Wen Fung Leong,Aaron R. Quinlan,Chip Stewart,Michael P. Stromberg,Alistair N. Ward,Jiantao Wu,Charles Lee,Ryan E. Mills,Xinghua Shi,Mark J. Daly,Mark A. DePristo,Aaron D. Ball,Eric Banks,Brian L. Browning,Kiran V. Garimella,Sharon R. Grossman,Robert E. Handsaker,Matt Hanna,Chris Hartl,Andrew M. Kernytsky,Joshua M. Korn,Heng Li,Jared R. Maguire,Steven A. McCarroll,Aaron McKenna,James C. Nemesh,Anthony A. Philippakis,Ryan E. Poplin,Alkes Price,Manuel A. Rivas,Pardis C. Sabeti,Stephen F. Schaffner,Ilya A. Shlyakhter,David N. Cooper,Edward V. Ball,Matthew Mort,Andrew D. Phillips,Peter D. Stenson,Jonathan Sebat,Vladimir Makarov,Kenny Ye,Seungtai C. Yoon,Adam Boyko,Jeremiah Degenhardt,Mark Kaganovich,Alon Keinan,Phil Lacroute,Xin Ma,Andy Reynolds,Laura Clarke,Fiona Cunningham,Javier Herrero,Stephen Keenen,Eugene Kulesha,Rasko Leinonen,William M. McLaren,Rajesh Radhakrishnan,Richard E. Smith,Vadim Zalunin,Xiangqun Zheng-Bradley,Jan O. Korbel,Adrian M. Stütz,Markus Bauer,R. Keira Cheetham,Tony Cox,Michael Eberle,Terena James,Scott Kahn,Lisa Murray,Kai Ye,Yutao Fu,Fiona C. L. Hyland,Jonathan M. Manning,Stephen F. McLaughlin,Heather E. Peckham,Onur Sakarya,Yongming A. Sun,Eric F. Tsung,Mark A. Batzer,Miriam K. Konkel,Jerilyn A. Walker,Marcus W. Albrecht,Vyacheslav S. Amstislavskiy,Ralf Herwig,Dimitri V. Parkhomchuk,Richa Agarwala,Hoda M. Khouri,Aleksandr O. Morgulis,Justin E. Paschall,Lon D. Phan,Kirill E. Rotmistrovsky,Robert D. Sanders,Martin F. Shumway,Chunlin Xiao,Adam Auton,Zamin Iqbal,Gerton Lunter,Jonathan L. Marchini,Loukas Moutsianas,Simon Myers,Afidalina Tumian,James Knight,Roger Winer,David W. Craig,Steve M. Beckstrom-Sternberg,Alexis Christoforides,Ahmet A. Kurdoglu,John V. Pearson,Shripad A. Sinari,Waibhav D. Tembe,David Haussler,Angie S. Hinrichs,Sol J. Katzman,Andrew Kern,Robert M. Kuhn,Molly Przeworski,Ryan D. Hernandez,Bryan Howie,Joanna L. Kelley,S. Cord Melton,Yun Li,Paul Anderson,Tom Blackwell,Wei Chen,William O. Cookson,Jun Ding,Hyun Min Kang,Mark Lathrop,Liming Liang,Miriam F. Moffatt,Paul Scheet,Carlo Sidore,Matthew Snyder,Xiaowei Zhan,Sebastian Zöllner,Philip Awadalla,Ferran Casals,Youssef Idaghdour,John Keebler,Eric A. Stone,Martine Zilversmit,Lynn Jorde,Jinchuan Xing,Evan E. Eichler,Gozde Aksay,Can Alkan,Iman Hajirasouliha,Fereydoun Hormozdiari,Jeffrey M. Kidd,S. Cenk Sahinalp,Peter H. Sudmant,Ken Chen,Asif Chinwalla,Li Ding,Daniel C. Koboldt,Mike D. McLellan,John W. Wallis,Michael C. Wendl,Qunyuan Zhang,Cornelis A. Albers,Qasim Ayub,Senduran Balasubramaniam,Jeffrey C. Barrett,Yuan Chen,Donald F. Conrad,Petr Danecek,Emmanouil T. Dermitzakis,Min Hu,Ni Huang,Matt E. Hurles,Hanjun Jin,Luke Jostins,Thomas M. Keane,Si Quang Le,Sarah Lindsay,Quan Long,Daniel G. MacArthur,Stephen B. Montgomery,Leopold Parts,Chris Tyler-Smith,Klaudia Walter,Yujun Zhang,Mark B. Gerstein,Michael Snyder,Alexej Abyzov,Suganthi Balasubramanian,Robert Bjornson,Jiang Du,Fabian Grubert,Lukas Habegger,Rajini Haraksingh,Justin Jee,Ekta Khurana,Hugo Y. K. Lam,Jing Leng,Xinmeng Jasmine Mu,Alexander E. Urban,Zhengdong Zhang,Cristian Coafra,Huyen Dinh,Christie Kovar,Sandy Lee,Lynne Nazareth,Jane Wilkinson,Allison Coffey,Carol Scott,Neda Gharani,Jane S. Kaye,Alastair Kent,Taosha Li,Amy L. McGuire,Pilar N. Ossorio,Charles N. Rotimi,Yeyang Su,Lorraine H. Toji,Chris TylerSmith,Lisa D. Brooks,Adam L. Felsenfeld,Jean E. McEwen,Assya Abdallah,Christopher R. Juenger,Nicholas C. Clemm,Audrey Duncanson,Eric D. Green,Mark S. Guyer,Jane L. Peterson
DOI: https://doi.org/10.1073/pnas.1019276108
IF: 11.1
2011-01-01
Proceedings of the National Academy of Sciences
Abstract:High-throughput sequencing technology enables population-level surveys of human genomic variation. Here, we examine the joint allele frequency distributions across continental human populations and present an approach for combining complementary aspects of whole-genome, low-coverage data and targeted high-coverage data. We apply this approach to data generated by the pilot phase of the Thousand Genomes Project, including whole-genome 2–4× coverage data for 179 samples from HapMap European, Asian, and African panels as well as high-coverage target sequencing of the exons of 800 genes from 697 individuals in seven populations. We use the site frequency spectra obtained from these data to infer demographic parameters for an Out-of-Africa model for populations of African, European, and Asian descent and to predict, by a jackknife-based approach, the amount of genetic diversity that will be discovered as sample sizes are increased. We predict that the number of discovered nonsynonymous coding variants will reach 100,000 in each population after ∼1,000 sequenced chromosomes per population, whereas ∼2,500 chromosomes will be needed for the same number of synonymous variants. Beyond this point, the number of segregating sites in the European and Asian panel populations is expected to overcome that of the African panel because of faster recent population growth. Overall, we find that the majority of human genomic variable sites are rare and exhibit little sharing among diverged populations. Our results emphasize that replication of disease association for specific rare genetic variants across diverged populations must overcome both reduced statistical power because of rarity and higher population divergence.
What problem does this paper attempt to address?