Abstract:This article defines a new hashing task motivated by real-world applications in content-based image retrieval, that is, effective data indexing and retrieval given mixed query (query image together with user-provided keywords). Our work is distinguished from state-of-the-art hashing research by two unique features: (1) Unlike conventional image retrieval systems, the input query is a combination of an exemplar image and several descriptive keywords, and (2) the input image data are often associated with multiple labels. It is an assumption that is more consistent with the realistic scenarios. The mixed image-keyword query significantly extends traditional image-based query and better explicates the user intention. Meanwhile it complicates semantics-based indexing on the multilabel data. Though several existing hashing methods can be adapted to solve the indexing task, unfortunately they all prove to suffer from low effectiveness. To enhance the hashing efficiency, we propose a novel scheme “boosted shared hashing”. Unlike prior works that learn the hashing functions on either all image labels or a single label, we observe that the hashing function can be more effective if it is designed to index over an optimal label subset. In other words, the association between labels and hash bits are moderately sparse. The sparsity of the bit-label association indicates greatly reduced computation and storage complexities for indexing a new sample, since only limited number of hashing functions will become active for the specific sample. We develop a Boosting style algorithm for simultaneously optimizing both the optimal label subsets and hashing functions in a unified formulation, and further propose a query-adaptive retrieval mechanism based on hash bit selection for mixed queries, no matter whether or not the query words exist in the training data. Moreover, we show that the proposed method can be easily extended to the case where the data similarity is gauged by nonlinear kernel functions. Extensive experiments are conducted on standard image benchmarks like CIFAR-10, NUS-WIDE and a-TRECVID. The results validate both the sparsity of the bit-label association and the convergence of the proposed algorithm, and demonstrate that the proposed hashing scheme achieves substantially superior performances over state-of-the-art methods under the same hash bit budget.

Learning vocabulary-based hashing with adaboost

Large-scale Image Retrieval Based on Boosting Iterative Quantization Hashing with Query-Adaptive Reranking.

Nonlinear Discrete Cross-Modal Hashing for Visual-Textual Data

Vocabulary-based hashing for image search.

Scalable Multimedia Retrieval By Deep Learning Hashing With Relative Similarity Learning

Efficient Fine-Grained Visual-Text Search Using Adversarially-Learned Hash Codes

Boosting Complementary Hash Tables for Fast Nearest Neighbor Search

Compact hashing for mixed image-keyword query over multi-label images

One Network for Multi-Domains: Domain Adaptive Hashing with Intersectant Generative Adversarial Networks.

Improved Deep Unsupervised Hashing Via Prototypical Learning

Complementary Hashing for Approximate Nearest Neighbor Search

Mixed Image-Keyword Query Adaptive Hashing over Multilabel Images

Piecewise Hashing: A Deep Hashing Method for Large-Scale Fine-Grained Search

Supervised Hashing Using Graph Cuts and Boosted Decision Trees

Fast and Accurate Hashing Via Iterative Nearest Neighbors Expansion.

Query-Adaptive Reciprocal Hash Tables for Nearest Neighbor Search

Model Optimization Boosting Framework for Linear Model Hash Learning.

Distributed Adaptive Binary Quantization for Fast Nearest Neighbor Search.

Deep Self-Adaptive Hashing for Image Retrieval

Boosted Curriculum Multi-View Hashing for Multimedia Retrieval

Asymmetric Deep Supervised Hashing