Uncalled4 improves nanopore DNA and RNA modification detection via fast and accurate signal alignment

Sam Kovaka,Paul W. Hook,Katharine M. Jenike,Vikram Shivakumar,Luke B. Morina,Roham Razaghi,Winston Timp,Michael C. Schatz
DOI: https://doi.org/10.1101/2024.03.05.583511
2024-03-11
Abstract:Nanopore signal analysis enables detection of nucleotide modifications from native DNA and RNA sequencing, providing both accurate genetic/transcriptomic and epigenetic information without additional library preparation. Presently, only a limited set of modifications can be directly basecalled (e.g. 5-methylcytosine), while most others require exploratory methods that often begin with alignment of nanopore signal to a nucleotide reference. We present Uncalled4, a toolkit for nanopore signal alignment, analysis, and visualization. Uncalled4 features an efficient banded signal alignment algorithm, BAM signal alignment file format, statistics for comparing signal alignment methods, and a reproducible training method for k-mer-based pore models, revealing potential errors in ONT’s state-of-the-art DNA model. We apply Uncalled4 to RNA 6-methyladenine (m6A) detection in seven human cell lines, identifying 26% more modifications than Nanopolish using m6Anet, including in several genes where m6A has known implications in cancer. Uncalled4 is available open-source at .
Bioinformatics
What problem does this paper attempt to address?