A Multi-Block N-ary Trie Structure for Exact R-Neighbour Search in Hamming Space

Yicheng Huang,Ling-Yu Duan,Zhe Wang,Jie Lin,Vijay Chandrasekhar,Tiejun Huang
DOI: https://doi.org/10.1109/icip.2017.8296455
2017-01-01
Abstract:This paper proposes a novel algorithm to solve the exact r-neighbour search problem in Hamming space. Existing r-neighbour search methods typically adopt hash table to index binary codes. Given a query, existing approaches search the nearest neighbours by checking all buckets of a Hamming ball centered at the query. The problem is these methods spend most of search time visiting empty buckets (lookup misses). In this paper, we adopt trie structure to index binary codes. We consider several continuous bits of a binary string as a block and use it as an atomic indexing element in trie structure, which is efficient in access speed and memory usage. Our method searches the nearest neighbours of a query by utilizing the records of nodes in trie structure to avoid lookup misses. We name the proposed indexing structure as Multi-Block N-ary Trie (MBNT). A theoretical analysis is given to prove that MBNT has less time cost than other hash-based methods. Extensive results show that MBNT outperforms state-of-the-art algorithms on several large scale benchmarks.
What problem does this paper attempt to address?