Abstract:This article defines a new hashing task motivated by real-world applications in content-based image retrieval, that is, effective data indexing and retrieval given mixed query (query image together with user-provided keywords). Our work is distinguished from state-of-the-art hashing research by two unique features: (1) Unlike conventional image retrieval systems, the input query is a combination of an exemplar image and several descriptive keywords, and (2) the input image data are often associated with multiple labels. It is an assumption that is more consistent with the realistic scenarios. The mixed image-keyword query significantly extends traditional image-based query and better explicates the user intention. Meanwhile it complicates semantics-based indexing on the multilabel data. Though several existing hashing methods can be adapted to solve the indexing task, unfortunately they all prove to suffer from low effectiveness. To enhance the hashing efficiency, we propose a novel scheme “boosted shared hashing”. Unlike prior works that learn the hashing functions on either all image labels or a single label, we observe that the hashing function can be more effective if it is designed to index over an optimal label subset. In other words, the association between labels and hash bits are moderately sparse. The sparsity of the bit-label association indicates greatly reduced computation and storage complexities for indexing a new sample, since only limited number of hashing functions will become active for the specific sample. We develop a Boosting style algorithm for simultaneously optimizing both the optimal label subsets and hashing functions in a unified formulation, and further propose a query-adaptive retrieval mechanism based on hash bit selection for mixed queries, no matter whether or not the query words exist in the training data. Moreover, we show that the proposed method can be easily extended to the case where the data similarity is gauged by nonlinear kernel functions. Extensive experiments are conducted on standard image benchmarks like CIFAR-10, NUS-WIDE and a-TRECVID. The results validate both the sparsity of the bit-label association and the convergence of the proposed algorithm, and demonstrate that the proposed hashing scheme achieves substantially superior performances over state-of-the-art methods under the same hash bit budget.

A Hash Centroid Construction Method with Swin Transformer for Multi-Label Image Retrieval.

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

AN image retrieval method based on multiple hyperspheres OC-SVM hashing

Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Label-affinity Self-adaptive Central Similarity Hashing for Image Retrieval

Supervised Hashing With Pseudo Labels For Scalable Multimedia Retrieval

Compact hashing for mixed image-keyword query over multi-label images

Deep Semantic-Preserving and Ranking-Based Hashing for Image Retrieval.

Deep Discriminative Quantization Hashing for Image Retrieval

Transductive Zero-Shot Hashing for Multilabel Image Retrieval

Specific class center guided deep hashing for cross-modal retrieval

Rank-Consistency Deep Hashing for Scalable Multi-Label Image Search

Deep Supervised Hashing for Fast Image Retrieval

Mixed Image-Keyword Query Adaptive Hashing over Multilabel Images

Deep Multiscale Fine-Grained Hashing for Remote Sensing Cross-Modal Retrieval

CNN Based Hashing for Image Retrieval.

LCEMH: Label Correlation Enhanced Multi-modal Hashing for Efficient Multi-modal Retrieval

A Multi-dimensional Equilibrium-depth Hash Image Retrieval Method.

Label consistent locally linear embedding based cross-modal hashing

HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval