Prototype-Based Discriminative Feature Representation for Class-incremental Cross-modal Retrieval.

Shaoquan Zhu,Yong Feng,Mingliang Zhou,Baohua Qiang,Bin Fang,Ran Wei
DOI: https://doi.org/10.1142/s021800142150018x
IF: 1.261
2020-01-01
International Journal of Pattern Recognition and Artificial Intelligence
Abstract:Cross-modal retrieval aims to retrieve the related items from various modalities with respect to a query from any type. The key challenge of cross-modal retrieval is to learn more discriminative representations between different category, as well as expand to an unseen class retrieval in the open world retrieval task. To tackle the above problem, in this paper, we propose a prototype learning-based discriminative feature learning (PLDFL) to learn more discriminative representations in a common space. First, we utilize a prototype learning algorithm to cluster these samples labeled with the same semantic class, by jointly taking into consideration the intra-class compactness and inter-class sparsity without discriminative treatments. Second, we use the weight-sharing strategy to model the correlations of cross-modal samples to narrow down the modality gap. Finally, we apply the prototype to achieve class-incremental learning to prove the robustness of our proposed approach. According to our experimental results, significant retrieval performance in terms of mAP can be achieved on average compared to several state-of-the-art approaches.
What problem does this paper attempt to address?