A Generic Inverted Index Framework for Similarity Search on the GPU - Technical Report

Jingbo Zhou,Qi Guo,H. V. Jagadish,Luboš Krčál,Siyuan Liu,Wenhao Luan,Anthony K. H. Tung,Yueji Yang,Yuxin Zheng
DOI: https://doi.org/10.48550/arXiv.1603.08390
2018-08-14
Abstract:We propose a novel generic inverted index framework on the GPU (called GENIE), aiming to reduce the programming complexity of the GPU for parallel similarity search of different data types. Not every data type and similarity measure are supported by GENIE, but many popular ones are. We present the system design of GENIE, and demonstrate similarity search with GENIE on several data types along with a theoretical analysis of search results. A new concept of locality sensitive hashing (LSH) named $\tau$-ANN search, and a novel data structure c-PQ on the GPU are also proposed for achieving this purpose. Extensive experiments on different real-life datasets demonstrate the efficiency and effectiveness of our framework. The implemented system has been released as open source.
Databases,Computer Vision and Pattern Recognition,Distributed, Parallel, and Cluster Computing,Data Structures and Algorithms
What problem does this paper attempt to address?