LCEMH: Label Correlation Enhanced Multi-modal Hashing for Efficient Multi-modal Retrieval

Chaoqun Zheng,Lei Zhu,Zheng Zhang,Wenjun Duan,Wenpeng Lu
DOI: https://doi.org/10.1016/j.ins.2023.120064
IF: 8.1
2024-01-04
Information Sciences
Abstract:Supervised multi-modal hashing can effectively improve the discriminative power of hash codes by leveraging semantic label information. However, most existing supervised multi-modal hashing models generally rely on discrete labels for supervision. They only provide limited guidance and may not fully capture the potential semantic similarity between multi-modal data. Moreover, the binary constraints of hash codes further restrict their ability to fully capture the rich semantic information embedded in the labels. To address these limitations, we propose a Label Correlation Enhanced Multi-modal Hashing (LCEMH) approach for efficient multi-modal retrieval. Specifically, our proposed model can effectively expand the inter-class boundaries of discrete labels to further enrich the supervision of similarity. Additionally, we directly transform the acquired richer label information into hash codes simply and effectively, thus retaining more comprehensive information in the labels with minimal quantization loss, and enhancing the discriminant ability of the generated hash codes. Experimental results on three public multimedia retrieval datasets demonstrate the superiority of LCEMH in performance. The source code of LCEMH is available online at: https://github.com/ChaoqunZheng/LCEMH.
computer science, information systems
What problem does this paper attempt to address?