Integrated hybrid de novo assembly technologies to obtain high-quality pig genome using short and long reads

Heng Du,Chenguang Diao,Pengju Zhao,Lei Zhou,Jian-Feng Liu
DOI: https://doi.org/10.1093/bib/bbaa399
IF: 9.5
2021-01-12
Briefings in Bioinformatics
Abstract:Abstract With the rapid progress of sequencing technologies, various types of sequencing reads and assembly algorithms have been designed to construct genome assemblies. Although recent studies have attempted to evaluate the appropriate type of sequencing reads and algorithms for assembling high-quality genomes, it is still a challenge to set the correct combination for constructing animal genomes. Here, we present a comparative performance assessment of 14 assembly combinations—9 software programs with different short and long reads of Duroc pig. Based on the results of the optimization process for genome construction, we designed an integrated hybrid de novo assembly pipeline, HSCG, and constructed a draft genome for Duroc pig. Comparison between the new genome and Sus scrofa 11.1 revealed important breakpoints in two S. scrofa 11.1 genes. Our findings may provide new insights into the pan-genome analysis studies of agricultural animals, and the integrated assembly pipeline may serve as a guide for the assembly of other animal genomes.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?