Abstract:The vanilla Few-shot Learning (FSL) learns to build a classifier for a new concept from one or very few target examples, with the general assumption that source and target classes are sampled from the same domain. Recently, the task of Cross-Domain Few-Shot Learning (CD-FSL) aims at tackling the FSL where there is a huge domain shift between the source and target datasets. Extensive efforts on CD-FSL have been made via either directly extending the meta-learning paradigm of vanilla FSL methods, or employing massive unlabeled target data to help learn models. In this paper, we notice that in the CD-FSL task, the few labeled target images have never been explicitly leveraged to inform the model in the training stage. However, such a labeled target example set is very important to bridge the huge domain gap. Critically, this paper advocates a more practical training scenario for CD-FSL. And our key insight is to utilize a few labeled target data to guide the learning of the CD-FSL model. Technically, we propose a novel Generalized Meta-learning based Feature-Disentangled Mixup network, namely GMeta-FDMixup. We make three key contributions of utilizing GMeta-FDMixup to address CD-FSL. Firstly, we present two mixup modules - mixup-P and mixup-M that help facilitate utilizing the unbalanced and disjoint source and target datasets. These two novel modules enable diverse image generation for training the model on the source domain. Secondly, to narrow the domain gap explicitly, we contribute a novel feature disentanglement module that learns to decouple the domain-irrelevant and domain-specific features. By stripping the domain-specific features, we alleviate the negative effects caused by the domain inductive bias. Finally, we repurpose a new contrastive learning module, dubbed ConL. ConL prevents the model from only capturing category-related features via introducing contrastive loss. Thus, the generalization ability on novel categories is improved. Extensive experimental results on two benchmarks show the superiority of our setting and the effectiveness of our method. Code and models will be released.

DisRot: boosting the generalization capability of few-shot learning via knowledge distillation and self-supervised learning

Knowledge Distillation-based Domain-invariant Representation Learning for Domain Generalization

Boosting Generalized Few-Shot Learning by Scattering Intra-class Distribution

Multi-directional Knowledge Transfer for Few-Shot Learning

Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning

Self-supervised Knowledge Distillation for Few-shot Learning

Progressive Network Grafting for Few-Shot Knowledge Distillation

Enhanced ProtoNet With Self-Knowledge Distillation for Few-Shot Learning

Reweighting and Information-Guidance Networks for Few-Shot Learning

Generalized Meta-FDMixup: Cross-Domain Few-Shot Learning Guided by Labeled Target Data

Knowledge Transduction for Cross-Domain Few-Shot Learning

Self-Supervision Can Be a Good Few-Shot Learner

Supervised Masked Knowledge Distillation for Few-Shot Transformers

Teachers cooperation: team-knowledge distillation for multiple cross-domain few-shot learning

Knowledge Distillation Meets Self-Supervision

One‐stage self‐distillation guided knowledge transfer for long‐tailed visual recognition

Multi-domain few-shot image recognition with knowledge transfer

Hierarchical Knowledge Propagation and Distillation for Few-Shot Learning.

Research on a Cross-Domain Few-Shot Adaptive Classification Algorithm Based on Knowledge Distillation Technology

Multistage feature fusion knowledge distillation

Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms