Pan-genome analysis highlights the extent of genomic variation in cultivated and wild rice

Qiang Zhao,Qi Feng,Hengyun Lu,Yan Li,Ahong Wang,Qilin Tian,Qilin Zhan,Yiqi Lu,Lei Zhang,Tao Huang,Yongchun Wang,Danlin Fan,Yan Zhao,Ziqun Wang,Congcong Zhou,Jiaying Chen,Chuanrang Zhu,Wenjun Li,Qijun Weng,Qun Xu,Zi-Xuan Wang,Xinghua Wei,Bin Han,Xuehui Huang
DOI: https://doi.org/10.1038/s41588-018-0041-z
IF: 30.8
2018-01-15
Nature Genetics
Abstract:The rich genetic diversity in Oryza sativa and Oryza rufipogon serves as the main sources in rice breeding. Large-scale resequencing has been undertaken to discover allelic variants in rice, but much of the information for genetic variation is often lost by direct mapping of short sequence reads onto the O. sativa japonica Nipponbare reference genome. Here we constructed a pan-genome dataset of the O. sativa–O. rufipogon species complex through deep sequencing and de novo assembly of 66 divergent accessions. Intergenomic comparisons identified 23 million sequence variants in the rice genome. This catalog of sequence variations includes many known quantitative trait nucleotides and will be helpful in pinpointing new causal variants that underlie complex traits. In particular, we systemically investigated the whole set of coding genes using this pan-genome data, which revealed extensive presence and absence of variation among rice accessions. This pan-genome resource will further promote evolutionary and functional studies in rice.
genetics & heredity
What problem does this paper attempt to address?