Genome assembly in the telomere-to-telomere era

Heng Li,Richard Durbin
DOI: https://doi.org/10.1038/s41576-024-00718-w
IF: 59.581
2024-04-22
Nature Reviews Genetics
Abstract:Genome sequences largely determine the biology and encode the history of an organism, and de novo assembly — the process of reconstructing the genome sequence of an organism from sequencing reads — has been a central problem in bioinformatics for four decades. Until recently, genomes were typically assembled into fragments of a few megabases at best, but now technological advances in long-read sequencing enable the near-complete assembly of each chromosome — also known as telomere-to-telomere assembly — for many organisms. Here, we review recent progress on assembly algorithms and protocols, with a focus on how to derive near-telomere-to-telomere assemblies. We also discuss the additional developments that will be required to resolve remaining assembly gaps and to assemble non-diploid genomes.
genetics & heredity
What problem does this paper attempt to address?