Telomere-to-telomere gapless genome assembly of the Chinese sea bass ( Lateolabrax maculatus )

Zhilong Sun,Shuo Li,Yuyan Liu,Weijing Li,Kaiqiang Liu,Xuebin Cao,Jiliang Lin,Hongyan Wang,Qian Wang,Changwei Shao
DOI: https://doi.org/10.1038/s41597-024-02988-9
2024-02-08
Scientific Data
Abstract:Chinese sea bass ( Lateolabrax maculatus ) is a highly sought-after commercial seafood species in Asian regions due to its excellent nutritional value. With the rapid advancement of bioinformatics, higher standards for genome analysis compared to previously published reference genomes are now necessary. This study presents a gapless assembly of the Chinese sea bass genome, which has a length of 632.75 Mb. The sequences were assembled onto 24 chromosomes with a coverage of over 99% (626.61 Mb), and telomeres were detected on 34 chromosome ends. Analysis using Merqury indicated a high level of accuracy, with an average consensus quality value of 54.25. The ONT ultralong and PacBio HiFi data were aligned with the assembly using minimap2, resulting in a mapping rate of 99.9%. The study also identified repeating elements in 20.90% (132.25 Mb) of the genome and inferred 22,014 protein-coding genes. These results establish meaningful groundwork for exploring the evolution of the Chinese sea bass genome and advancing molecular breeding techniques.
multidisciplinary sciences
What problem does this paper attempt to address?
The main problem this paper attempts to address is the construction of a gapless genome assembly for the Chinese sea bass (*Lateolabrax maculatus*). Specifically, the research team successfully assembled a high-quality telomere-to-telomere (T2T) genome by integrating PacBio HiFi sequencing, Oxford Nanopore Technologies (ONT) ultra-long sequencing, and Hi-C technology. This genome assembly has a length of 632.75 Mb, covering more than 99% of the 24 chromosomes, and telomeres were detected at 34 chromosome ends. The main objectives include: 1. **Improving the quality of genome assembly**: Compared to previously published reference genomes, the new assembly is more complete, gapless, and significantly enhances the continuity and accuracy of the genome. 2. **Facilitating genetic research**: The high-quality genome assembly provides an important resource for population genetics studies and evolutionary analysis. 3. **Optimizing molecular breeding techniques**: The gapless genome assembly helps identify more functional genes and repetitive sequences, laying the foundation for the development of molecular breeding techniques. Through these efforts, the research team hopes to better understand the genome structure and function of the Chinese sea bass, thereby promoting its application and development in aquaculture.