Mapping and sequencing of structural variation from eight human genomes

Jeffrey M. Kidd,Gregory M. Cooper,William F. Donahue,Hillary S. Hayden,Nick Sampas,Tina Graves,Nancy Hansen,Brian Teague,Can Alkan,Francesca Antonacci,Eric Haugen,Troy Zerr,N. Alice Yamada,Peter Tsang,Tera L. Newman,Eray Tüzün,Ze Cheng,Heather M. Ebling,Nadeem Tusneem,Robert David,Will Gillett,Karen A. Phelps,Molly Weaver,David Saranga,Adrianne Brand,Wei Tao,Erik Gustafson,Kevin McKernan,Lin Chen,Maika Malig,Joshua D. Smith,Joshua M. Korn,Steven A. McCarroll,David A. Altshuler,Daniel A. Peiffer,Michael Dorschner,John Stamatoyannopoulos,David Schwartz,Deborah A. Nickerson,James C. Mullikin,Richard K. Wilson,Laurakay Bruhn,Maynard V. Olson,Rajinder Kaul,Douglas R. Smith,Evan E. Eichler
DOI: https://doi.org/10.1038/nature06862
IF: 64.8
2008-05-01
Nature
Abstract:Genetic variation among individual humans occurs on many different scales, ranging from gross alterations in the human karyotype to single nucleotide changes. Here we explore variation on an intermediate scale—particularly insertions, deletions and inversions affecting from a few thousand to a few million base pairs. We employed a clone-based method to interrogate this intermediate structural variation in eight individuals of diverse geographic ancestry. Our analysis provides a comprehensive overview of the normal pattern of structural variation present in these genomes, refining the location of 1,695 structural variants. We find that 50% were seen in more than one individual and that nearly half lay outside regions of the genome previously described as structurally variant. We discover 525 new insertion sequences that are not present in the human reference genome and show that many of these are variable in copy number between individuals. Complete sequencing of 261 structural variants reveals considerable locus complexity and provides insights into the different mutational processes that have shaped the human genome. These data provide the first high-resolution sequence map of human structural variation—a standard for genotyping platforms and a prelude to future individual genome sequencing projects.
multidisciplinary sciences
What problem does this paper attempt to address?