MSRCall: A Multi-scale Deep Neural Network to Basecall Oxford Nanopore Sequences
Yang-Ming Yeh,Yi-Chang Lu
DOI: https://doi.org/10.1093/bioinformatics/btac435
IF: 5.8
2022-07-08
Bioinformatics
Abstract:MinION, a third-generation sequencer from Oxford Nanopore Technologies, is a portable device that can provide long nucleotide read data in real-time. It primarily aims to deduce the makeup of nucleotide sequences from the ionic current signals generated when passing DNA/RNA fragments through nanopores charged with a voltage difference. To determine nucleotides from measured signals, a translation process known as basecalling is required. However, compared to NGS basecallers, the calling accuracy of MinION still needs to be improved. In this work, a simple but powerful neural network architecture called MSRCall is proposed. MSRCall comprises a multi-scale structure, recurrent layers, a fusion block, and a CTC decoder. To better identify both short-range and long-range dependencies, the recurrent layer is redesigned to capture various time-scale features with a multi-scale structure. The results show that MSRCall outperforms other basecallers in terms of both read and consensus accuracies. MSRCall is available at: https://github.com/d05943006/MSRCall Supplementary data are available.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?