Fast nanopore sequencing data analysis with SLOW5

Hasindu Gamaarachchi,Hiruna Samarakoon,Sasha P. Jenner,James M. Ferguson,Timothy G. Amos,Jillian M. Hammond,Hassaan Saadat,Martin A. Smith,Sri Parameswaran,Ira W. Deveson
DOI: https://doi.org/10.1038/s41587-021-01147-4
IF: 46.9
2022-01-03
Nature Biotechnology
Abstract:Abstract Nanopore sequencing depends on the FAST5 file format, which does not allow efficient parallel analysis. Here we introduce SLOW5, an alternative format engineered for efficient parallelization and acceleration of nanopore data analysis. Using the example of DNA methylation profiling of a human genome, analysis runtime is reduced from more than two weeks to approximately 10.5 h on a typical high-performance computer. SLOW5 is approximately 25% smaller than FAST5 and delivers consistent improvements on different computer architectures.
biotechnology & applied microbiology
What problem does this paper attempt to address?