Abstract:This article defines a new hashing task motivated by real-world applications in content-based image retrieval, that is, effective data indexing and retrieval given mixed query (query image together with user-provided keywords). Our work is distinguished from state-of-the-art hashing research by two unique features: (1) Unlike conventional image retrieval systems, the input query is a combination of an exemplar image and several descriptive keywords, and (2) the input image data are often associated with multiple labels. It is an assumption that is more consistent with the realistic scenarios. The mixed image-keyword query significantly extends traditional image-based query and better explicates the user intention. Meanwhile it complicates semantics-based indexing on the multilabel data. Though several existing hashing methods can be adapted to solve the indexing task, unfortunately they all prove to suffer from low effectiveness. To enhance the hashing efficiency, we propose a novel scheme “boosted shared hashing”. Unlike prior works that learn the hashing functions on either all image labels or a single label, we observe that the hashing function can be more effective if it is designed to index over an optimal label subset. In other words, the association between labels and hash bits are moderately sparse. The sparsity of the bit-label association indicates greatly reduced computation and storage complexities for indexing a new sample, since only limited number of hashing functions will become active for the specific sample. We develop a Boosting style algorithm for simultaneously optimizing both the optimal label subsets and hashing functions in a unified formulation, and further propose a query-adaptive retrieval mechanism based on hash bit selection for mixed queries, no matter whether or not the query words exist in the training data. Moreover, we show that the proposed method can be easily extended to the case where the data similarity is gauged by nonlinear kernel functions. Extensive experiments are conducted on standard image benchmarks like CIFAR-10, NUS-WIDE and a-TRECVID. The results validate both the sparsity of the bit-label association and the convergence of the proposed algorithm, and demonstrate that the proposed hashing scheme achieves substantially superior performances over state-of-the-art methods under the same hash bit budget.

Multimedia Semantics-Aware Query-Adaptive Hashing with Bits Reconfigurability

Online latent semantic hashing for cross-media retrieval.

Learning Reconfigurable Hashing for Diverse Semantics

Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Semantic Consistency Hashing for Cross-Modal Retrieval

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Supervised Semantic-Embedded Hashing for Multimedia Retrieval

Mixed Image-Keyword Query Adaptive Hashing over Multilabel Images

Efficient Multi-modal Hashing with Online Query Adaption for Multimedia Retrieval

Scalable Multimedia Retrieval By Deep Learning Hashing With Relative Similarity Learning

Multi-modal Hashing for Efficient Multimedia Retrieval: A Survey

Supervised Hashing With Pseudo Labels For Scalable Multimedia Retrieval

Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval

Compact hashing for mixed image-keyword query over multi-label images

Discrete Semantic Alignment Hashing for Cross-Media Retrieval

Efficient Semi-Supervised Multimodal Hashing With Importance Differentiation Regression

Supervised Multi-scale Locality Sensitive Hashing

Guided Hash Algorithm for Information Semantic Retrieval in Multimedia Environment

Query-Adaptive Image Search with Hash Codes.

One for more: Structured Multi-Modal Hashing for multiple multimedia retrieval tasks