Self-supervised incomplete cross-modal hashing retrieval
Shouyong Peng,Tao Yao,Ying Li,Gang Wang,Lili Wang,Zhiming Yan
DOI: https://doi.org/10.1016/j.eswa.2024.125592
IF: 8.5
2024-11-11
Expert Systems with Applications
Abstract:Benefiting from fast retrieval speed and low storage costs, cross-modal hashing retrieval has become a widely-used approximate nearest-neighbor technique in large-scale data retrieval. Most existing cross-modal hashing methods assume that the cross-modal data points are complete. However, cross-modal data completeness is difficult to be satisfied in the real world, because of the indefinite factors in data collecting. Moreover, due to the expensive cost of annotating all data points in large-scale applications, there is a growing interest in unsupervised hashing retrieval that can learn the correlations of cross-modal data without ground-truth. Therefore, how to perform unsupervised hashing retrieval on incomplete cross-modal data becomes a problem worthy of study. In this paper, we propose a Self-supervised Incomplete Cross-modal Hashing retrieval (SICH) method, which integrates data recovery and hashing encoding into a unified framework. Specifically, we first design a self-supervised semantic module to effectively mine the semantic information among pseudo-labels, and then a hash code dictionary is constructed to guide the hashing function learning with an asymmetric guidance mechanism. Besides, to fully take advantage of the incomplete data points in cross-modal learning, we introduce a data recovery network aiming at recovering missing data by minimizing conditional entropy and maximizing mutual information between different modalities. Extensive experiments on two benchmark datasets verify that our method consistently outperforms state-of-the-art cross-modal hashing methods.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science