Abstract:Supervised cross-modal hashing has attracted many researchers. In these studies, they seek a common semantic space or directly regress the zero-one label information into the Hamming space. Although they achieve many achievements, they neglect some issues: 1) some methods of the classification task are not suitable for retrieval tasks, since they are lack of learning personalized features of sample; 2) the outcomes of hash retrieval are related to both the length and encoding method of hash codes. Because a sample possess more personalized features than label semantics, in this paper, we propose a novel supervised cross-modal hashing collaboration learning method called discrete Cross-modal Hashing with Relaxation and Label Semantic Guidance (CHRLSG). First, we introduce two relaxation variables as latent spaces. One is used to extract text features and label semantic information collaboratively, and the other is used to extract image features and label semantics collaboratively. Second, the more accurate hash codes are generated from latent spaces, since CHRLSG learns collaboratively feature semantics and label semantics by using labels as the domination and features as the auxiliary. Third, we utilize labels to strengthen the similar relationship of inter-modal samples via keeping the pairwise closeness. Label semantics are made full use of to avoid classification error. Fourth, we introduce class weight to further increase the discrimination of samples that belong to different classes in intra-modal and keep the similarity of samples unchanged. Therefore, CHRLSG model preserves not only the relationship between samples, but also maintains the consistency of label semantic during collaboration optimization. Experimental results of three common benchmark datasets demonstrate that the proposed model is superior to the existing advanced methods.

Semantic-consistent cross-modal hashing for large-scale image retrieval

Semantic Consistency Hashing for Cross-Modal Retrieval

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Supervised Coarse-to-Fine Semantic Hashing for Cross-Media Retrieval.

Discrete Similarity Preserving Hashing for Cross-modal Retrieval.

Nonlinear Discrete Cross-Modal Hashing for Visual-Textual Data

Semantic-rebased cross-modal hashing for scalable unsupervised text-visual retrieval

Semi-supervised Semi-paired Cross-modal Hashing

Asymmetric Supervised Consistent and Specific Hashing for Cross-Modal Retrieval

Deep Cross-modal Hashing Based on Semantic Consistent Ranking

Sequential Discrete Hashing for Scalable Cross-Modality Similarity Retrieval

Discrete cross-modal hashing with relaxation and label semantic guidance

Scalable Unsupervised Hashing via Exploiting Robust Cross-modal Consistency

Semantic embedding based online cross-modal hashing method

Fast discrete cross-modal hashing with semantic consistency

Integration of Semantic and Visual Hashing for Image Retrieval

A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval

Discrete Joint Semantic Alignment Hashing for Cross-Modal Image-Text Search

Latent semantic-enhanced discrete hashing for cross-modal retrieval

Label consistent locally linear embedding based cross-modal hashing