MAFH: Multilabel Aware Framework for Bit-Scalable Cross-Modal Hashing

Xue Li,Jiong Yu,Hongchun Lu,Shaochen Jiang,Ziyang Li,Peiyun Yao
DOI: https://doi.org/10.1016/j.knosys.2023.110922
IF: 8.139
2023-01-01
Knowledge-Based Systems
Abstract:Deep hashing techniques have received considerable attention and are widely used in the field of cross-modal retrieval. The training mode of deep cross-modal hashing models has also gradually converged. However, the commonly used training mode ignores noisy labels from different modality original datasets and can only generate fixed-length hash codes, resulting in poor robustness and flexibility of the training generated hash models. Therefore, to effectively optimize the training mode, we propose a multilabel aware framework for bit-scalable deep cross-modal hashing (MAFH) with the following contributions. First, we introduce a label network module to indirectly used the hash representation of multilabels to more accurately supervise the model training process across all modalities. Second, we design a scalable coding strategy to freely manipulate the length of the hash code, which makes the trained model flexible to adapt to different scenarios. Third, we propose a layered multilabel similarity algorithm to preserve more complete feature information, which avoids the loss of discriminative feature information during model training. The training mode of MAFH can effectively enhance model robustness, flexibility and accuracy. We selected thirteen representative hash methods to compare with the proposed method, and experimental results on four public datasets show that the proposed method achieves good performance. https://github.com/x-28/MAFH.git.
What problem does this paper attempt to address?