Abstract:Zero-shot learning (ZSL) in visual classification aims to recognize novel categories for which few or even no training samples are available. Through recent advances using generative adversarial networks (GANs) for cross-modal generation, several generative methods have been investigated for ZSL to classify unseen categories with synthetic samples. However, these GAN-based ZSL approaches still struggle to generate samples with semantic consistency and significant between-class discrepancy while preserving within-class diversity, which are vital to building classifiers for unseen classes. Accordingly, in this paper, we propose a robust dual-stream GAN to synthesize satisfactory samples for zero-shot visual classification. In more detail, the inter-class discrepancy is maximized by a backbone compatibility loss, which drives the center of the synthesized samples to move towards the center of real samples of the same class while moving further away from samples of different classes. Secondly, in order to preserve the intra-class diversity ignored by most extant paradigms, we propose a stochastic dispersion regularization to encourage the synthesized samples to be distributed at arbitrary points in the visual space of their categories. Finally, unlike previous methods that project visual samples back into semantic space and consequently cause an information degradation problem, we design a dual-stream generator to synthesize visual samples and reconstruct semantic embedding simultaneously, thereby ensuring semantic consistency. Our model outperforms the state-of-the-arts by 4.7% and 3.0% on average in two metrics over four real-world datasets, demonstrating its effectiveness and superiority.

Information Bottleneck and Selective Noise Supervision for Zero-Shot Learning

GENERATING MANIFOLD-ALIGNED SEMANTIC FEATURE FOR ZERO-SHOT LEARNING

Information Bottleneck Constrained Latent Bidirectional Embedding for Zero-Shot Learning

Unbiased Hybrid Generation Network for Zero-Shot Learning

OntoZSL: Ontology-enhanced Zero-shot Learning

Multi-modal Generative Adversarial Network for Zero-Shot Learning

Transductive Unbiased Embedding for Zero-Shot Learning

Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning

Bidirectional Generative Transductive Zero-Shot Learning.

Zero-Shot Learning with Few Seen Class Samples.

A Joint Generative Model For Zero-Shot Learning

Holistically Associated Transductive Zero-Shot Learning

A Probabilistic Zero-Shot Learning Method Via Latent Nonnegative Prototype Synthesis of Unseen Classes.

Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Zero-shot Learning Via the Fusion of Generation and Embedding for Image Recognition

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning

Joint Visual and Semantic Optimization for Zero-Shot Learning

Generalized Zero-Shot Recognition based on Visually Semantic Embedding

Zero-Shot Learning with Joint Generative Adversarial Networks

Learning Modality-Consistent Latent Representations for Generalized Zero-Shot Learning

Dual-stream Generative Adversarial Networks for Distributionally Robust Zero-Shot Learning.