NextDenovo: an efficient error correction and accurate assembly tool for noisy long reads

Jiang Hu,Zhuo Wang,Zongyi Sun,Benxia Hu,Adeola Oluwakemi Ayoola,Fan Liang,Jingjing Li,José R. Sandoval,David N. Cooper,Kai Ye,Jue Ruan,Chuan-Le Xiao,Depeng Wang,Dong-Dong Wu,Sheng Wang
DOI: https://doi.org/10.1186/s13059-024-03252-4
IF: 17.906
2024-04-28
Genome Biology
Abstract:Long-read sequencing data, particularly those derived from the Oxford Nanopore sequencing platform, tend to exhibit high error rates. Here, we present NextDenovo, an efficient error correction and assembly tool for noisy long reads, which achieves a high level of accuracy in genome assembly. We apply NextDenovo to assemble 35 diverse human genomes from around the world using Nanopore long-read data. These genomes allow us to identify the landscape of segmental duplication and gene copy number variation in modern human populations. The use of NextDenovo should pave the way for population-scale long-read assembly using Nanopore long-read data.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?