Eight high-quality genomes reveal pan-genome architecture and ecotype differentiation of Brassica napus

Jia-Ming Song,Zhilin Guan,Jianlin Hu,Chaocheng Guo,Zhiquan Yang,Shuo Wang,Dongxu Liu,Bo Wang,Shaoping Lu,Run Zhou,Wen-Zhao Xie,Yuanfang Cheng,Yuting Zhang,Kede Liu,Qing-Yong Yang,Ling-Ling Chen,Liang Guo
DOI: https://doi.org/10.1038/s41477-019-0577-7
IF: 18
2020-01-01
Nature Plants
Abstract:Abstract Rapeseed ( Brassica napus ) is the second most important oilseed crop in the world but the genetic diversity underlying its massive phenotypic variations remains largely unexplored. Here, we report the sequencing, de novo assembly and annotation of eight B. napus accessions. Using pan-genome comparative analysis, millions of small variations and 77.2–149.6 megabase presence and absence variations (PAVs) were identified. More than 9.4% of the genes contained large-effect mutations or structural variations. PAV-based genome-wide association study (PAV-GWAS) directly identified causal structural variations for silique length, seed weight and flowering time in a nested association mapping population with ZS11 (reference line) as the donor, which were not detected by single-nucleotide polymorphisms-based GWAS (SNP-GWAS), demonstrating that PAV-GWAS was complementary to SNP-GWAS in identifying associations to traits. Further analysis showed that PAVs in three FLOWERING LOCUS C genes were closely related to flowering time and ecotype differentiation. This study provides resources to support a better understanding of the genome architecture and acceleration of the genetic improvement of B. napus .
plant sciences
What problem does this paper attempt to address?