A Draft Sequence of the Rice (oryza Sativa Ssp Indica) Genome
Jun Yu,Songnian Hu,Jun Wang,Songgang Li,Ka-Shu Gane Wong,Bin Liu,Yajun Deng,Li Dai,Yan Zhou,Xiuqing Zhang,Mengliang Cao,Jing Liu,Jiandong Sun,Jiabin Tang,Yanjiong Chen,Xiaobing Huang,Wei Lin,Chen Ye,Wei Tong,Lijuan Cong,Jianing Geng,Yujun Han,Lin Li,Wei Li,Guangqiang Hu,Xiangang Huang,Wenjie Li,Jian Li,Zhanwei Liu,Long Li,Jianping Liu,Qiuhui Qi,Jinsong Liu,Li,Xuegang Wang,Hong Lu,Tingting Wu,Miao Zhu,Peixiang Ni,Hua Han,Wei Dong,Xiaoyu Ren,Xiaoli Feng,Peng Cui,Xianran Li,Hao Wang,Xin Xu,Wenxue Zhai,Zhao Xu,Jinsong Zhang,Sijie He,Jianguo Zhang,Jichen Xu,Kunlin Zhang,Xianwu Zheng,Jianhai Dong,Wanyong Zeng,Lin Tao,Xuewei Chen,Jun He,Daofeng Liu,Wei Tian,Chaoguang Tian,Hongai Xia,Gang Li,Hui Gao,Ping Li,Wei Chen,Xudong Wang,Yong Zhang,Jianfei Hu,Jing Wang,Song Liu,Jian Yang,Guangyu Zhang,Yuqing Xiong,Zhijie Li,Long Mao,Chengshu Zhou,Zhen Zhu,Runsheng Chen,Bailin Hao,Weimou Zheng,Shouyi Chen,Wei Guo,Guojie Li,Siqi Liu,Guyang Huang,Ming Tao,Jian Wang,Lihuang Zhu,Longping Yuan,Huanming Yang
DOI: https://doi.org/10.1007/bf02901901
2001-01-01
Chinese Science Bulletin
Abstract:The sequence of the rice genome holds fundamental information for its biology, including physiology, genetics, development, and evolution, as well as information on many beneficial phenotypes of economic significance. Using a "whole genome shotgun" approach, we have produced a draft rice genome sequence of Oryza sativa ssp. indica, the major crop rice subspecies in China and many other regions of Asia. The draft genome sequence is constructed from over 4.3 million successful sequencing traces with an accumulative total length of 2214.9 Mb. The initial assembly of the non-redundant sequences reached 409.76 Mb in length, based on 3.30 million successful sequencing traces with a total length of 1797.4 Mb from an indica variant cultivar 93-11, giving an estimated coverage of 95.29% of the rice genome with an average base accuracy of higher than 99%. The coverage of the draft sequence, the randomness of the sequence distribution, and the consistency of BIG-ASSEMBLER, a custom-designed software package used for the initial assembly, were verified rigorously by comparisons against finished BAC clone sequences from both indica and japanica strains, available from the public databases. Over all, 96.3% of full-length cDNAs, 96.4% of STS, STR, RFLP markers, 94.0% of ESTs and 94.9% unigene clusters were identified from the draft sequence. Our preliminary analysis on the data set shows that our rice draft sequence is consistent with the comman standard accepted by the genome sequencing community. The unconditional release of the draft to the public also undoubtedly provides a fundamental resource to the international scientific communities to facilitate genomic and genetic studies on rice biology.