Despi: Efficient Classification of Metagenomics Reads with Lightweight De Bruijn Graph-Based Reference Indexing

Dengfeng Guan,Bo Liu,Yadong Wang
DOI: https://doi.org/10.1109/bibm.2018.8621235
2018-01-01
Abstract:One of the core problems in metagenomics is the classification of shotgun sequencing reads to identify species present in samples. Many supervised classification tools have been developed recently, but they either consume large memory or large computation time. Herein we propose a new classification method, de Bruijn Graph-based Species Identifier (deSPI), which takes advantage of de Bruijn graph and FM-index data structures and a hierarchical top-down strategy to do classification. The experimental results suggest that deSPI uses much less memory than Clark and Kraken and classifies reads much faster than Centrifuge and Kaiju, while maintaining a comparable sensitivity and accuracy.
What problem does this paper attempt to address?