Dual Collaborative Visual-Semantic Mapping for Multi-Label Zero-Shot Image Recognition

Yunqing Hu,Xuan Jin,Xi Chen,Yin Zhang
DOI: https://doi.org/10.1109/icassp49357.2023.10095788
2023-01-01
Abstract:Multi-label zero-shot learning (ML-ZSL), with the difficulty of both multi-label learning and zero-shot learning, aims to recognize various unseen objects that are not observed during training. Previous methods mainly use a single directional visual-semantic mapping to associate the visual and semantic embedding space, which is not sufficient to adequately realize knowledge transfer from seen to unseen classes. In this paper, we propose a novel dual collaborative visual-semantic mapping framework, constructing abundant connection relationships by exploring two aspects of mapping streams, i.e., the visual-to-semantic (V2S) mapping and the semantic-to-visual (S2V) mapping. Through the collaborative learning of these two effective mappings, our method achieves state-of-the-art performance on the MS-COCO and PASCAL-VOC, two benchmarks for ML-ZSL.
What problem does this paper attempt to address?