Abstract:Unsupervised domain adaptation has limitations when encountering label discrepancy between the source and target domains. While open-set domain adaptation approaches can address situations when the target domain has additional categories, these methods can only detect them but not further classify them. In this paper, we focus on a more challenging setting dubbed Domain Adaptive Zero-Shot Learning (DAZSL), which uses semantic embeddings of class tags as the bridge between seen and unseen classes to learn the classifier for recognizing all categories in the target domain when only the supervision of seen categories in the source domain is available. The main challenge of DAZSL is to perform knowledge transfer across categories and domain styles simultaneously. To this end, we propose a novel end-to-end learning mechanism dubbed Three-way Semantic Consistent Embedding (TSCE) to embed the source domain, target domain, and semantic space into a shared space. Specifically, TSCE learns domain-irrelevant categorical prototypes from the semantic embedding of class tags and uses them as the pivots of the shared space. The source domain features are aligned with the prototypes via their supervised information. On the other hand, the mutual information maximization mechanism is introduced to push the target domain features and prototypes towards each other. By this way, our approach can align domain differences between source and target images, as well as promote knowledge transfer towards unseen classes. Moreover, as there is no supervision in the target domain, the shared space may suffer from the catastrophic forgetting problem. Hence, we further propose a ranking-based embedding alignment mechanism to maintain the consistency between the semantic space and the shared space. Experimental results on both I2AwA and I2WebV clearly validate the effectiveness of our method. Code is available at https://github.com/tiggers23/TSCE-Domain-Adaptive-Zero-Shot-Learning.

ZeroAE: Pre-trained Language Model Based Autoencoder for Transductive Zero-shot Text Classification

Zero-Shot Learning with Generative Latent Prototype Model.

Zero-shot Text Classification via Reinforced Self-training

Transductive Unbiased Embedding for Zero-Shot Learning

Transductive discriminative dictionary learning approach for zero-shot classification

Semantic Autoencoder for Zero-Shot Learning

Zero-VAE-GAN: Generating Unseen Features for Generalized and Transductive Zero-Shot Learning

Transductive Zero-Shot Learning with a Self-Training Dictionary Approach

OntoZSL: Ontology-enhanced Zero-shot Learning

Transformer-Based Approach Via Contrastive Learning for Zero-Shot Detection.

A Semantic Similarity Supervised Autoencoder for Zero-Shot Learning.

Zero-Shot Text Classification via Self-Supervised Tuning

Class label autoencoder for zero-shot learning

Retrieval Augmented Zero-Shot Text Classification

Semantic Consistent Embedding for Domain Adaptive Zero-Shot Learning

Transductive Zero-Shot Learning With Adaptive Structural Embedding

Bi-Adversarial Auto-Encoder for Zero-Shot Learning

A Distance-Constrained Semantic Autoencoder for Zero-Shot Remote Sensing Scene Classification

TransZero: Attribute-guided Transformer for Zero-Shot Learning

Liberating Seen Classes: Boosting Few-Shot and Zero-Shot Text Classification via Anchor Generation and Classification Reframing

Zero-Shot Learning via Discriminative Dual Semantic Auto-Encoder