Abstract:Hashing technology has exhibited great cross-modal retrieval potential due to its appealing retrieval efficiency and storage effectiveness. Most current supervised cross-modal retrieval methods heavily rely on accurate semantic supervision, which is intractable for annotations with ever-growing sample sizes. By comparison, the existing unsupervised methods rely on accurate sample similarity preservation strategies with intensive computational costs to compensate for the lack of semantic guidance, which causes these methods to lose the power to bridge the semantic gap. Furthermore, both kinds of approaches need to search for the nearest samples among all samples in a large search space, whose process is laborious. To address these issues, this paper proposes an unsupervised dual deep hashing (UDDH) method with semantic-index and content-code for cross-modal retrieval. Deep hashing networks are utilized to extract deep features and jointly encode the dual hashing codes in a collaborative manner with a common semantic index and modality content codes to simultaneously bridge the semantic and heterogeneous gaps for cross-modal retrieval. The dual deep hashing architecture, comprising the head code on semantic index and tail codes on modality content, enhances the efficiency for cross-modal retrieval. A query sample only needs to search for the retrieved samples with the same semantic index, thus greatly shrinking the search space and achieving superior retrieval efficiency. UDDH integrates the learning processes of deep feature extraction, binary optimization, common semantic index, and modality content code within a unified model, allowing for collaborative optimization to enhance the overall performance. Extensive experiments are conducted to demonstrate the retrieval superiority of the proposed approach over the state-of-the-art baselines.

Dark knowledge association guided hashing for unsupervised cross-modal retrieval

Efficient Discrete Supervised Hashing for Large-scale Cross-modal Retrieval

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Discrete Similarity Preserving Hashing for Cross-modal Retrieval.

Semantic Consistency Hashing for Cross-Modal Retrieval

CKDH: CLIP-based Knowledge Distillation Hashing for Cross-modal Retrieval

Deep Adaptively-Enhanced Hashing With Discriminative Similarity Guidance for Unsupervised Cross-Modal Retrieval

Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval

High-order nonlocal Hashing for unsupervised cross-modal retrieval

Deep Class-guided Hashing for Multi-label Cross-modal Retrieval

Discrete Two-Step Cross-Modal Hashing Through the Exploitation of Pairwise Relations

Unsupervised Dual Deep Hashing with Semantic-Index and Content-Code for Cross-Modal Retrieval

Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval

Deep Semantic-Alignment Hashing for Unsupervised Cross-Modal Retrieval

Specific class center guided deep hashing for cross-modal retrieval

Pseudo-label driven deep hashing for unsupervised cross-modal retrieval

A High-Dimensional Sparse Hashing Framework for Cross-Modal Retrieval

Clustering-driven Deep Adversarial Hashing for scalable unsupervised cross-modal retrieval

Semi-supervised Semi-paired Cross-modal Hashing

Supervised Hierarchical Online Hashing for Cross-modal Retrieval