Genome sequence of the palaeopolyploid soybean
Jeremy Schmutz,Steven B. Cannon,Jessica Schlueter,Jianxin Ma,Therese Mitros,William Nelson,David L. Hyten,Qijian Song,Jay J. Thelen,Jianlin Cheng,Dong Xu,Uffe Hellsten,Gregory D. May,Yeisoo Yu,Tetsuya Sakurai,Taishi Umezawa,Madan K. Bhattacharyya,Devinder Sandhu,Babu Valliyodan,Erika Lindquist,Myron Peto,David Grant,Shengqiang Shu,David Goodstein,Kerrie Barry,Montona Futrell-Griggs,Brian Abernathy,Jianchang Du,Zhixi Tian,Liucun Zhu,Navdeep Gill,Trupti Joshi,Marc Libault,Anand Sethuraman,Xue-Cheng Zhang,Kazuo Shinozaki,Henry T. Nguyen,Rod A. Wing,Perry Cregan,James Specht,Jane Grimwood,Dan Rokhsar,Gary Stacey,Randy C. Shoemaker,Scott A. Jackson
DOI: https://doi.org/10.1038/nature08670
IF: 64.8
2010-01-01
Nature
Abstract:Soybean (Glycine max) is one of the most important crop plants for seed protein and oil content, and for its capacity to fix atmospheric nitrogen through symbioses with soil-borne microorganisms. We sequenced the 1.1-gigabase genome by a whole-genome shotgun approach and integrated it with physical and high-density genetic maps to create a chromosome-scale draft sequence assembly. We predict 46,430 protein-coding genes, 70% more than Arabidopsis and similar to the poplar genome which, like soybean, is an ancient polyploid (palaeopolyploid). About 78% of the predicted genes occur in chromosome ends, which comprise less than one-half of the genome but account for nearly all of the genetic recombination. Genome duplications occurred at approximately 59 and 13 million years ago, resulting in a highly duplicated genome with nearly 75% of the genes present in multiple copies. The two duplication events were followed by gene diversification and loss, and numerous chromosome rearrangements. An accurate soybean genome sequence will facilitate the identification of the genetic basis of many soybean traits, and accelerate the creation of improved soybean varieties.
multidisciplinary sciences