Graph Convolutional Network Based Unsupervised Cross-Modal Hashing Retrieval

Ding Shuyan,Yu Heng,Li Lunbo,Guo Jianhui
DOI: https://doi.org/10.19734/j.issn.1001-3695.2022.07.0398
2023-01-01
Abstract:To solve the insufficient mining problem of semantic correlation information within a single modality in the unsupervised cross-modal retrieval task, this paper proposed an unsupervised cross-modal hash retrieval(UCMHR) method based on GCN. It obtained the features of the two modalities through the image and text encoders, respectively, input the features into the GCN to exploit the single intra-modal semantic information. Then it calculated the loss by comparing with the deep semantic correlation similarity matrix, so the generated binary codes were continuously reconstructed and optimized until the robust hashing expression corresponding to the samples was generated. The experimental results show that the cross-modal retrieval accuracy of this method on multiple datasets improves significantly, compared with the classical shallow methods and deep-learning methods. It is proved that the semantic information within the modality can be further mined through the graph convolutional network, the proposed model has higher accuracy and robustness.
What problem does this paper attempt to address?