LV Barcoding: locality sensitive hashing-based tool for rapid species identification in DNA barcoding

Long Fan,Ka Hou Chu
DOI: https://doi.org/10.48550/arXiv.1407.3348
2014-07-12
Abstract:DNA barcoding has emerged as a cost-effective approach for species identification. However, the scarcity of tools used for searching the booming reference database becomes an obstacle, currently with BLAST as the only practical choice. Here, we propose a program - LV Barcoding - based on both the random hyperplane projection-based locality sensitive hashing method and the composition vector-based VIP Barcoding for fast species identification. The performance of LV Barcoding is assessed on the data release of BOLD. LV Barcoding has higher accuracy than BLAST, and is able to match a single query against ~114,000 reference barcodes within 10 seconds on a desktop computer. This program is available at <a class="link-external link-http" href="http://msl.sls.cuhk.edu.hk/vipbarcoding/" rel="external noopener nofollow">this http URL</a>.
Populations and Evolution
What problem does this paper attempt to address?