Abstract: Zero-Shot Learning (ZSL) targets at recognizing unseen categories by leveraging auxiliary information, such as attribute embedding. Despite the encouraging results achieved, prior ZSL approaches focus on improving the discriminant power of seen-class features, yet have largely overlooked the geometric structure of the samples and the prototypes. The subsequent attribute-based generative adversarial network (GAN), as a result, also neglects the topological information in sample generation and further yields inferior performances in classifying the visual features of unseen classes. In this paper, we introduce a novel structure-aware feature generation scheme, termed as SA-GAN, to explicitly account for the topological structure in learning both the latent space and the generative networks. Specifically, we introduce a constraint loss to preserve the initial geometric structure when learning a discriminative latent space, and carry out our GAN training with additional supervising signals from a structure-aware discriminator and a reconstruction module. The former supervision distinguishes fake and real samples based on their affinity to class prototypes, while the latter aims to reconstruct the original feature space from the generated latent space. This topology-preserving mechanism enables our method to significantly enhance the generalization capability on unseen-classes and consequently improve the classification performance. Experiments on four benchmarks demonstrate that the proposed approach consistently outperforms the state of the art. Our code can be found in the supplementary material and will also be made publicly available.

Zero-Shot Learning via Structure-Aligned Generative Adversarial Network

GENERATING MANIFOLD-ALIGNED SEMANTIC FEATURE FOR ZERO-SHOT LEARNING

Zero-Shot Learning with Generative Latent Prototype Model.

Joint Learning of Attended Zero-Shot Features and Visual-Semantic Mapping.

Multi-modal Generative Adversarial Network for Zero-Shot Learning

Structure-Aware Feature Generation for Zero-Shot Learning

OntoZSL: Ontology-enhanced Zero-shot Learning

Visual-Semantic Aligned Bidirectional Network for Zero-Shot Learning

Zero-Shot Learning with Joint Generative Adversarial Networks

Unbiased Hybrid Generation Network for Zero-Shot Learning

Multi-Label Zero-Shot Learning with Structured Knowledge Graphs

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder

Manifold Regularized Cross-Modal Embedding for Zero-Shot Learning

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning

Zero-Shot Visual Recognition Using Semantics-Preserving Adversarial Embedding Networks

SR-GAN: Semantic Rectifying Generative Adversarial Network for Zero-shot Learning

Bi-Adversarial Auto-Encoder for Zero-Shot Learning

Zero-shot Learning Via Shared-Reconstruction-Graph Pursuit

Visual feature synthesis with semantic reconstructor for traditional and generalized zero‐shot object classification

Visual Data Synthesis Via GAN for Zero-Shot Video Classification

A Structure-Enhanced Generative Adversarial Network for Knowledge Graph Zero-Shot Relational Learning.