A chromosome-level haplotype-resolved genome assembly of oriental tobacco budworm ( Helicoverpa assulta)

Yalong Xu,Chen Wang,Zefeng Li,Xueao Zheng,Zhengzhong Kang,Peng Lu,Jianfeng Zhang,Peijian Cao,Qiansi Chen,Xiaoguang Liu
DOI: https://doi.org/10.1038/s41597-024-03264-6
2024-05-07
Scientific Data
Abstract:Oriental tobacco budworm ( Helicoverpa assulta ) and cotton bollworm ( Helicoverpa armigera ) are two closely related species within the genus Helicoverpa. They have similar appearances and consistent damage patterns, often leading to confusion. However, the cotton bollworm is a typical polyphagous insect, while the oriental tobacco budworm belongs to the oligophagous insects. In this study, we used Nanopore, PacBio, and Illumina platforms to sequence the genome of H. assulta and used Hifiasm to create a haplotype-resolved draft genome. The Hi-C technique helped anchor 33 primary contigs to 32 chromosomes, including two sex chromosomes, Z and W. The final primary haploid genome assembly was approximately 415.19 Mb in length. BUSCO analysis revealed a high degree of completeness, with 99.0% gene coverage in this genome assembly. The repeat sequences constituted 38.39% of the genome assembly, and we annotated 17093 protein-coding genes. The high-quality genome assembly of the oriental tobacco budworm serves as a valuable genetic resource that enhances our comprehension of how they select hosts in a complex odour environment. It will also aid in developing an effective control policy.
multidisciplinary sciences
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to construct a high - quality haplotype - resolved genome assembly of the oriental tobacco budworm (*Helicoverpa assulta*). Specifically, the researchers sequenced the genome of *H. assulta* using the Nanopore, PacBio and Illumina platforms, and used the Hifiasm software to create a haplotype - resolved genome draft. Through Hi - C technology, the researchers anchored 33 major contigs to 32 chromosomes, including the two sex chromosomes Z and W. The final haplotype genome assembly is approximately 415.19 Mb in length. BUSCO analysis shows that the gene coverage is as high as 99.0%, the repetitive sequences account for 38.39% of the genome assembly, and a total of 17,093 protein - coding genes have been annotated. In addition, this high - quality genome assembly provides valuable genetic resources for understanding the host selection mechanism of the oriental tobacco budworm in a complex odor environment, which is helpful for developing effective control strategies. The study also explored the evolutionary relationship between the oriental tobacco budworm and the cotton bollworm (*Helicoverpa armigera*) and the changes in gene families, further revealing the differences between these two insects in terms of host range, pesticide resistance, pheromone component ratio and reproductive capacity.