Time- and memory-efficient genome assembly with Raven

Robert Vaser,Mile Šikić
DOI: https://doi.org/10.1038/s43588-021-00073-4
2021-05-01
Nature Computational Science
Abstract:Whole genome sequencing technologies are unable to invariably read DNA molecules intact, a shortcoming that assemblers try to resolve by stitching the obtained fragments back together. Here, we present methods for the improvement of de novo genome assembly from erroneous long reads incorporated into a tool called Raven. Raven maintains similar performance for various genomes and has accuracy on par with other assemblers that support third-generation sequencing data. It is one of the fastest options while having the lowest memory consumption on the majority of benchmarked datasets.
What problem does this paper attempt to address?