Abstract:Zero-Shot Learning (ZSL) aims to employ seen images and their related semantics to identify unseen images through knowledge transfer. Among past numerous methods, the generative methods are more prominent and achieve better results than other methods. However, we find the input for generating samples is too monotonous, there are only semantics of each class and artificially defined noise, which makes the generated visual features non-discriminative and the classifier cannot effectively distinguish them. In order to solve this problem, we propose a novel approach with cascade Generative Adversarial Network (GAN) to generate discriminative and representative features. In this method, we define a latent space where the features from different categories are orthogonal to each other and the generator for this latent space is learned with a Wasserstein GAN. In addition, in order to make up for the deficiency that the features in this latent space cannot accurately simulate the true distribution of species, we utilize another Wasserstein GAN or Cramér GAN cascaded with the previous one to generate more discriminative and representative visual features. In this way, we can not only expand the content used as input in the generation process, but also make the final generated visual features clear and separable under the influence of latent spatial orthogonality. Extensive experiments on five benchmark datasets, i.e., AWA1, AWA2, CUB, SUN and APY, demonstrate that our proposed method can outperform most of the state-of-the-art methods on both conventional and generalized zero-shot learning settings.

Semantic Enhanced Cross-modal GAN for Zero-shot Learning

GENERATING MANIFOLD-ALIGNED SEMANTIC FEATURE FOR ZERO-SHOT LEARNING

Zero-Shot Learning with Generative Latent Prototype Model.

OntoZSL: Ontology-enhanced Zero-shot Learning

Multi-modal Generative Adversarial Network for Zero-Shot Learning

Zero-Shot Learning with Joint Generative Adversarial Networks

Semantics-Guided Intra-Category Knowledge Transfer for Generalized Zero-Shot Learning

SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning

Generative-based hybrid model with semantic representations for generalized zero-shot learning

Manifold Regularized Cross-Modal Embedding for Zero-Shot Learning

A Joint Generative Model For Zero-Shot Learning

Learning Modality-Consistent Latent Representations for Generalized Zero-Shot Learning

Learning MLatent Representations for Generalized Zero-Shot Learning

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning

Semantic Compression Embedding for Generative Zero-Shot Learning

Visual Data Synthesis Via GAN for Zero-Shot Video Classification

Generation-based Contrastive Model with Semantic Alignment for Generalized Zero-Shot Learning

Visual feature synthesis with semantic reconstructor for traditional and generalized zero‐shot object classification

Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning

Fine-Grained Generation for Zero-Shot Learning.