Abstract:Generalized Zero-Shot Learning (GZSL) aims to recognize both seen and unseen categories by establishing visual and semantic relations. Recently, generation-based methods that focus on synthesizing fictitious visual features from corresponding attributes have gained significant attention. However, these generated features often lack discriminative capabilities due to inadequate training of the generative model. To address this issue, we propose a novel Discriminative Enhanced Network (DENet) to harness the potential of the generative model by adapting the training features and imposing constraints on the generated features. Our approach incorporates three pivotal modules: (1) Before the generative network training, we implement a Pre-Tuning Module (PTM) to eliminate irrelevant background noise in the raw features extracted from a fixed CNN backbone. Therefore, PTM can provide tuned training features without redundant noise for generative model. (2) During the generative network training, we propose an Asymmetry Cross-authenticity Contrastive (AC2) loss to group visual features of the same category while repel features from different categories by optimizing a large number of sample pairs. Additionally, we incorporate intra-class and relation-specific inter-class boundaries within the AC2 loss to enrich sample diversity and preserve valid semantic information. (3) Also within the generative network training, a Dual-semantic Alignment Module (DAM) is designed to align visual features with both attributes and label embeddings, enabling the model to learn attribute-related information and discriminative extended semantics. Experiments on four standard benchmarks demonstrate that our approach learns more discriminative features and surpasses the existing methods.

Towards Discriminative Feature Generation for Generalized Zero-Shot Learning

GENERATING MANIFOLD-ALIGNED SEMANTIC FEATURE FOR ZERO-SHOT LEARNING

Cluster-based Contrastive Disentangling for Generalized Zero-Shot Learning

Contrastive Visual Feature Filtering for Generalized Zero-Shot Learning

Learning discriminative and representative feature with cascade GAN for generalized zero-shot learning

Multi-modal Generative Adversarial Network for Zero-Shot Learning

Residual-Prototype Generating Network for Generalized Zero-Shot Learning

Adaptive Conditional Denoising Diffusion Model with Hybrid Affinity Regularizer for Generalized Zero-shot Learning

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning

Semantic-Related Feature Generation for Generalized Zero-Shot Learning

Unbiased Hybrid Generation Network for Zero-Shot Learning

Leveraging Self-Distillation and Disentanglement Network to Enhance Visual–Semantic Feature Consistency in Generalized Zero-Shot Learning

Semantics Disentangling for Generalized Zero-Shot Learning

Attentive Semantic Preservation Network for Zero-Shot Learning.

Dual-aligned Feature Confusion Alleviation for Generalized Zero-shot Learning

Prototype-Augmented Self-Supervised Generative Network for Generalized Zero-Shot Learning.

Generalized Zero-Shot Learning With Multiple Graph Adaptive Generative Networks

Generative-based hybrid model with semantic representations for generalized zero-shot learning

One-Stage Training Generative Paradigm for Generalized Zero-Shot Learning.

Data Driven Recurrent Generative Adversarial Network for Generalized Zero Shot Image Classification