KMC 3: counting and manipulating k-mer statistics

Marek Kokot,Maciej Długosz,Sebastian Deorowicz
DOI: https://doi.org/10.48550/arXiv.1701.08022
2017-01-27
Abstract:Summary: Counting all k-mers in a given dataset is a standard procedure in many bioinformatics applications. We introduce KMC3, a significant improvement of the former KMC2 algorithm together with KMC tools for manipulating k-mer databases. Usefulness of the tools is shown on a few real problems. Availability: Program is freely available at <a class="link-external link-http" href="http://sun.aei.polsl.pl/REFRESH/kmc" rel="external noopener nofollow">this http URL</a>. Contact: <a class="link-external link-http" href="http://sebastian.deorowicz" rel="external noopener nofollow">this http URL</a>@polsl.pl
Genomics,Distributed, Parallel, and Cluster Computing,Data Structures and Algorithms
What problem does this paper attempt to address?