Pan-Genome of Wild and Cultivated Soybeans.

Yucheng Liu,Huilong Du,Pengcheng Li,Yanting Shen,Hua Peng,Shulin Liu,Guo-An Zhou,Haikuan Zhang,Zhi Liu,Miao Shi,Xuehui Huang,Yan Li,Min Zhang,Zheng Wang,Baoge Zhu,Bin Han,Chengzhi Liang,Zhixi Tian
DOI: https://doi.org/10.1016/j.cell.2020.05.023
IF: 64.5
2020-01-01
Cell
Abstract:Soybean is one of the most important vegetable oil and protein feed crops. To capture the entire genomic diversity, it is needed to construct a complete high-quality pan-genome from diverse soybean accessions. In this study, we performed individual de novo genome assemblies for 26 representative soybeans that were selected from 2,898 deeply sequenced accessions. Using these assembled genomes together with three previously reported genomes, we constructed a graph-based genome and performed pan-genome analysis, which identified numerous genetic variations that cannot be detected by direct mapping of short sequence reads onto a single reference genome. The structural variations from the 2,898 accessions that were genotyped based on the graph-based genome and the RNA sequencing (RNA-seq) data from the representative 26 accessions helped to link genetic variations to candidate genes that are responsible for important traits. This pan-genome resource will promote evolutionary and functional genomics studies in soybean.
What problem does this paper attempt to address?