HashNeck is a Boosting Tool for Deep Learning to Hashing

Hua Gao,ChenChen Hu,Guang Han,Jiafa Mao,Wei Huang,Kaiyuan Wan
DOI: https://doi.org/10.1145/3652583.3658053
2024-01-01
Abstract:The goal of hashing for image and video retrieval is to encode multimedia data into compact binary codes, allowing for efficient approximate nearest neighbor search by ensuring that similar images or videos have closely related codes in Hamming space. To improve the effectiveness of hashing, we propose introducing a classification task to assist in training the hash network and enhance the discriminability of predicted hash codes. Unlike conventional multi-task learning approaches, we propose a HashNeck structure for the classification branch that utilizes the similarities between the expected and predicted hash codes to determine whether a neuron should participate in the classification task. By only guiding neurons with correctly predicted hash codes through the classification task, we effectively resolve the conflict between the hash and classification tasks. We evaluated the effectiveness of our proposed method on benchmark image and video datasets, including ImageNet100, MS COCO, NUS-WIDE, UCF-101, and HMDB51. The experimental results on image and video retrieval tasks demonstrate that our method outperforms state-of-the-art hashing methods in terms of retrieval performance. These compelling results demonstrate the superiority of our algorithm and its potential for improving the field of deep learning to hashing.
What problem does this paper attempt to address?