Abstract:Hashing has recently sparked a great revolution in cross-modal retrieval because of its low storage cost and high query speed. Recent cross-modal hashing methods often learn unified or equal-length hash codes to represent the multi-modal data and make them intuitively comparable. However, such unified or equal-length hash representations could inherently sacrifice their representation scalability because the data from different modalities may not have one-to-one correspondence and could be encoded more efficiently by different hash codes of unequal lengths. To mitigate these problems, this paper exploits a related and relatively unexplored problem: encode the heterogeneous data with varying hash lengths and generalize the cross-modal retrieval in various challenging scenarios. To this end, a generalized and flexible cross-modal hashing framework, termed Matrix Tri-Factorization Hashing (MTFH), is proposed to work seamlessly in various settings including paired or unpaired multi-modal data, and equal or varying hash length encoding scenarios. More specifically, MTFH exploits an efficient objective function to flexibly learn the modality-specific hash codes with different length settings, while synchronously learning two semantic correlation matrices to semantically correlate the different hash representations for heterogeneous data comparable. As a result, the derived hash codes are more semantically meaningful for various challenging cross-modal retrieval tasks. Extensive experiments evaluated on public benchmark datasets highlight the superiority of MTFH under various retrieval scenarios and show its competitive performance with the state-of-the-arts.

Modality-specific Matrix Factorization Hashing for Cross-Modal Retrieval

Discrete Cross-Modal Hashing for Efficient Multimedia Retrieval

Semantic Consistency Hashing for Cross-Modal Retrieval

Frustratingly Easy Cross-Modal Hashing

Semantic Constraints Matrix Factorization Hashing for Cross-Modal Retrieval

MTFH: A Matrix Tri-Factorization Hashing Framework for Efficient Cross-Modal Retrieval

Robust Supervised Matrix Factorization Hashing with Application to Cross-Modal Retrieval.

Cross-modal retrieval via label category supervised matrix factorization hashing

Discrete Matrix Factorization Cross-Modal Hashing with Multi-Similarity Consistency

Supervised Matrix Factorization for Cross-Modality Hashing.

UCMH: Unpaired cross-modal hashing with matrix factorization

Joint Coupled-Hashing Representation for Cross-Modal Retrieval

HDMFH: Hypergraph Based Discrete Matrix Factorization Hashing for Multimodal Retrieval

Supervised Intra- and Inter-Modality Similarity Preserving Hashing for Cross-Modal Retrieval.

Joint and Individual Matrix Factorization Hashing for Large-Scale Cross-Modal Retrieval

Online supervised collective matrix factorization hashing for cross-modal retrieval

Cross-Modal Hash Method Based on Multi-Scale Fusion and Projection Matching Constraint

Average Approximate Hashing-Based Double Projections Learning for Cross-Modal Retrieval

Modality-specific structure preserving hashing for cross-modal retrieval

Large-Scale Cross-Modality Search via Collective Matrix Factorization Hashing

Cross-modal retrieval based on multi-dimensional feature fusion hashing