Supervised adaptive similarity consistent latent representation hashing

Hongbin Wang,Rui Chen,Zhenqiu Shu,Yafei Zhang,Huafeng Li
DOI: https://doi.org/10.1016/j.neucom.2023.127113
IF: 6
2023-12-16
Neurocomputing
Abstract:Cross-modal hashing has attracted significant attention in multimedia data similarity given its appealing computational cost and retrieval performance . Supervised hashing benefits from the auxiliary learning of a similarity matrix, which is usually predefined by inner product features or category labels. However, a predefined similarity matrix fails to reflect the real similarity relationship between image-text pairs. In addition, existing methods fix the weights to a value or update them by introducing sensitive dataset-related hyper-parameters. To overcome these problems, we propose a method to perform supervised adaptive similarity consistent latent representation hashing (SCLRH) that adaptively learns the similarity matrix during hashing learning. In SCLRH, we assume that multimodal data are observed and reconstructed from different perspectives of a common consistent latent representation. Instead of using a predefined similarity matrix, SCLRH adaptively learns this matrix to reflect the underlying manifold structure and describes the fine-grained similarity between consistent latent representations. In addition, SCLRH introduces a self-weighted learning strategy to update the weights based on the contributions of different modalities without involving additional hyper-parameters. Experimental results on three benchmark datasets demonstrate the superiority of the proposed SCLRH for cross-modal retrieval.
computer science, artificial intelligence
What problem does this paper attempt to address?