Dual-Uncertainty Guided Cycle-Consistent Network for Zero-Shot Learning

Yilei Zhang,Yi Tian,Sihui Zhang,Yaping Huang
DOI: https://doi.org/10.1109/tcsvt.2023.3272111
IF: 5.859
2023-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Zero-shot learning (ZSL) aims to identify novel categories via transferring shared semantic knowledge from seen classes to unseen ones. Since labeled samples of novel categories are unavailable in training phase, visual and semantic spaces are difficult to align precisely. Besides, the uncertainties inherent in fixed visual features and predefined semantic prototypes are always neglected, which also play important roles in modeling unbiased visual-semantic embeddings. In this paper, we propose a Dual-uncertainty Guided Cycle-consistent Network (DGCNet) for ZSL, which aims to learn a robust semantic-to-visual mapping to generate visual centers based on semantic prototypes. Firstly, we propose a cycle-consistent embedding framework, which consists of visual generation sub-network and semantic preservation sub-network. The former generates a primary visual center for each category, while the latter remaps obtained centers back to semantic space to further ensure the consistency between reconstructed semantic embeddings and original prototypes. These two sub-networks explore the intrinsic bidirectional relationships between visual and semantic features complementarily, thus effectively mitigating the alignment shift problem. Furthermore, we develop dual uncertainty perception modules, namely visual uncertainty module and semantic uncertainty module, on the basis of the above two sub-networks. These modules are designed to measure visual and semantic uncertainties of sample features and class prototypes, respectively, which avoid our model overfitting to noisy data and unreliable prototypes. Substantially, the dual uncertainty perception modules contribute to improving the discriminability and adaptability of our DGCNet. Extensive experiments on various datasets demonstrate the effectiveness of our proposed method.
What problem does this paper attempt to address?