Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation

Mikhail Kolmogorov,Kimberley J. Billingsley,Mira Mastoras,Melissa Meredith,Jean Monlong,Ryan Lorig-Roach,Mobin Asri,Pilar Alvarez Jerez,Laksh Malik,Ramita Dewan,Xylena Reed,Rylee M. Genner,Kensuke Daida,Sairam Behera,Kishwar Shafin,Trevor Pesout,Jeshuwin Prabakaran,Paolo Carnevali,Jianzhi Yang,Arang Rhie,Sonja W. Scholz,Bryan J. Traynor,Karen H. Miga,Miten Jain,Winston Timp,Adam M. Phillippy,Mark Chaisson,Fritz J. Sedlazeck,Cornelis Blauwendraat,Benedict Paten
DOI: https://doi.org/10.1038/s41592-023-01993-x
IF: 48
2023-09-15
Nature Methods
Abstract:Long-read sequencing technologies substantially overcome the limitations of short-reads but have not been considered as a feasible replacement for population-scale projects, being a combination of too expensive, not scalable enough or too error-prone. Here we develop an efficient and scalable wet lab and computational protocol, Napu, for Oxford Nanopore Technologies long-read sequencing that seeks to address those limitations. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the National Institutes of Health Center for Alzheimer's and Related Dementias. Using a single PromethION flow cell, we can detect single nucleotide polymorphisms with F1-score comparable to Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but achieves good concordance to Illumina indel calls elsewhere. Further, we can discover structural variants with F1-score on par with state-of-the-art de novo assembly methods. Our protocol phases small and structural variants at megabase scales and produces highly accurate, haplotype-specific methylation calls.
biochemical research methods
What problem does this paper attempt to address?