A rice variation map derived from 10 548 rice accessions reveals the importance of rare variants

Tianyi Wang,Wenchuang He,Xiaoxia Li,Chao Zhang,Huiying He,Qiaoling Yuan,Bin Zhang,Hong Zhang,Yue Leng,Hua Wei,Qiang Xu,Chuanlin Shi,Xiangpei Liu,Mingliang Guo,Xianmeng Wang,Wu Chen,Zhipeng Zhang,Longbo Yang,Yang Lv,Hongge Qian,Bintao Zhang,Xiaoman Yu,Congcong Liu,Xinglan Cao,Yan Cui,Qianqian Zhang,Xiaofan Dai,Longbiao Guo,Yuexing Wang,Yongfeng Zhou,Jue Ruan,Qian Qian,Lianguang Shang
DOI: https://doi.org/10.1093/nar/gkad840
IF: 14.9
2023-10-16
Nucleic Acids Research
Abstract:Abstract Detailed knowledge of the genetic variations in diverse crop populations forms the basis for genetic crop improvement and gene functional studies. In the present study, we analyzed a large rice population with a total of 10 548 accessions to construct a rice super-population variation map (RSPVM), consisting of 54 378 986 single nucleotide polymorphisms, 11 119 947 insertion/deletion mutations and 184 736 presence/absence variations. Assessment of variation detection efficiency for different population sizes revealed a sharp increase of all types of variation as the population size increased and a gradual saturation of that after the population size reached 10 000. Variant frequency analysis indicated that ∼90% of the obtained variants were rare, and would therefore likely be difficult to detect in a relatively small population. Among the rare variants, only 2.7% were predicted to be deleterious. Population structure, genetic diversity and gene functional polymorphism of this large population were evaluated based on different subsets of RSPVM, demonstrating the great potential of RSPVM for use in downstream applications. Our study provides both a rich genetic basis for understanding natural rice variations and a powerful tool for exploiting great potential of rare variants in future rice research, including population genetics and functional genomics.
biochemistry & molecular biology
What problem does this paper attempt to address?