Improving Supervised Cross-modal Retrieval with Semantic Graph Embedding

Changting Feng,Dagang Li,Jingwei Zheng
DOI: https://doi.org/10.1007/978-3-030-67832-6_16
2021-01-01
Abstract:This paper focuses on the use of embedding with global semantic relations to improve the cross modal retrieval. Our method smoothly bridges the heterogeneity gap by graph embedding and then obtains discriminative representation by supervised learning. First, we construct a semantic correlation graph based on the intra-modal similarity and the semantic propagation of pairwise information. Then, embeddings are learnt from the graph semantic structure which enables all the cross-modal data to be mapped into the same space. Second, based on the previous embeddings, we adopt a simple one-branch neural network to enhance the discrimination of the representation by minimizing the discrimination loss and reconstruction loss. Experimental results on three widely-used benchmark datasets clearly demonstrate the improvement of the proposed approach over the state-of-the-art cross-modal retrieval methods.
What problem does this paper attempt to address?