Deciphering variation of 239 elite japonica rice genomes for whole genome sequences-enabled breeding

Chuanxue Liu,Pei Peng,Weiguo Li,Changrong Ye,Shuhua Zhang,Ruiying Wang,Dong Li,Shiwu Guan,Lanmin Zhang,Xiaoqun Huang,Zhenhua Guo,Junxiang Guo,Yu Long,Le Li,Guojun Pan,Bingchuan Tian,Jinhua Xiao
DOI: https://doi.org/10.1016/j.ygeno.2021.07.002
IF: 4.31
2021-09-01
Genomics
Abstract:<p>Revealing genomic variation of representative and diverse germplasm is the cornerstone of deploying genomics information into genetic improvement programs of species of agricultural importance. Here we report the re-sequencing of 239 <em>japonica</em> rice elites representing the genetic diversity of <em>japonica</em> germplasm in China, Japan and Korea. A total of 4.8 million SNPs and PAV of 35,634 genes were identified. The elites from Japan and Korea are closely related and relatively less diverse than those from China. A <em>japonica</em> rice pan-genome was constructed, and 35 Mb non-redundant novel sequences were identified, from which 1131 novel genes were predicted. Strong selection signals of genomic regions were detected on most of the chromosomes. The heading date genes <em>Hd1</em> and <em>Hd3a</em> have been artificially selected during the breeding process. The results from this study lay the foundation for future whole genome sequences-enabled breeding in rice and provide a paradigm for other species.</p>
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?
The paper attempts to address the issue of revealing the genomic variations of 239 representative high-quality japonica rice varieties (from China, Japan, and Korea) to support breeding programs based on whole-genome sequencing. Specifically, the authors aim to identify single nucleotide polymorphisms (SNPs) and presence-absence variations (PAVs) among these varieties by resequencing them, construct a pan-genome of japonica rice, and discover new gene sequences. Additionally, the study aims to detect signals of artificial selection, particularly in gene regions associated with important agronomic traits, thereby providing a foundation for future breeding driven by whole-genome sequencing. ### Main Research Objectives: 1. **Reveal Genomic Variations**: Identify SNPs and PAVs among 239 high-quality japonica rice varieties through resequencing. 2. **Construct Pan-Genome**: Use the genomic data of these varieties to construct a japonica rice pan-genome that includes new gene sequences. 3. **Detect Selection Signals**: Detect signals of artificial selection in the genome, especially in gene regions related to important agronomic traits (such as flowering time, disease resistance, etc.). 4. **Analyze Known Important Genes**: Study allele variations at loci of known important genes in these varieties to assess their potential application value in breeding. ### Research Background: - **Limitations of Traditional Breeding**: Traditional breeding mainly relies on phenotypic traits and characteristics to select parents and design hybrid combinations, which is time-consuming and inefficient. - **Application of Molecular Markers**: With the development of molecular marker technologies such as RFLP, RAPD, AFLP, SSR, and SNP, it is possible to evaluate the genetic diversity of core germplasm resources or parents on a genome-wide scale, thereby improving the accuracy of parent selection and hybrid design. - **Advantages of Whole-Genome Sequencing**: The development of whole-genome sequencing technology allows for a more comprehensive understanding of genomic variations in each parent, including SNPs, structural variations, and presence-absence variations, thereby improving breeding efficiency. ### Research Methods: - **Sample Collection and Genome Resequencing**: Collect leaf tissues from 239 japonica rice varieties, extract genomic DNA, and perform whole-genome sequencing. - **SNP and PAV Analysis**: Use tools like BWA and GATK for SNP identification and filtering, and use the EUPAN package for PAV analysis. - **Population Structure Analysis**: Use ADMIXTURE and neighbor-joining tree methods for population structure analysis to reveal the genetic relationships of these varieties. - **Variation Analysis of Known Important Genes**: Analyze allele variations at loci of known important genes in these varieties. - **Pan-Genome Construction**: Perform de novo assembly of reads not aligned to the reference genome, remove redundant and contaminant sequences, and construct the japonica rice pan-genome. ### Research Results: - **Genomic Variations**: A total of 4.8 million SNPs were identified, with 62.6% located in intergenic regions and 37.4% in genic regions. - **Population Structure**: Based on SNP and PAV analysis, these varieties were divided into 4 subgroups, with significant genetic differences between Chinese varieties and those from Japan and Korea. - **Variation of Known Important Genes**: Favorable alleles were found at loci of several important disease resistance genes, plant height genes, and cold tolerance genes. - **Pan-Genome**: A japonica rice pan-genome containing 35 Mb of non-redundant new sequences was constructed, predicting 1131 new genes. - **Selection Signals**: Strong selection signals were detected on multiple chromosomes, particularly in gene regions related to flowering time. ### Conclusion: This study revealed the genomic variations of 239 high-quality japonica rice varieties, constructed a japonica rice pan-genome, and discovered new gene sequences and selection signals. These results provide an important foundation for future breeding driven by whole-genome sequencing, helping to improve breeding efficiency and develop new varieties with excellent agronomic traits.