Dual discriminant adversarial cross-modal retrieval

Pei He,Meng Wang,Ding Tu,Zhuo Wang,He, Pei,Wang, Meng,Tu, Ding,Wang, Zhuo
DOI: https://doi.org/10.1007/s10489-022-03653-7
IF: 5.3
2022-06-07
Applied Intelligence
Abstract:In order to improve the accuracy of cross-modal retrieval tasks and achieve flexible retrieval between different modalities, we propose a Dual Discriminant Adversarial cross-modal Retrieval (DDAC) method in this paper. First, DDAC integrates adversarial learning and minimization of feature projection distances and introduces label information in it. It can eliminate the same semantic heterogeneity between modalities while maintaining the distinguishability of different semantic features between modalities. Then, cosine distance is used to minimize and maximize the inter-modal distance of features with the same and different labels respectively to solve the inter-modal discrimination problem. Different from the general method, DDAC performs dual discrimination in the label space and solves the intra-modal discrimination problem from two perspectives of probability distribution and distance. Extensive experiments carried out on three public datasets validate that the proposed DDAC outperforms the state-of-the-art methods.
computer science, artificial intelligence
What problem does this paper attempt to address?