Abstract:Knowledge transfer-based few-shot learning (FSL) aims at improving the recognition ability of a novel object under limited training samples by transferring relevant potential knowledge from other data. Most related methods calculate such knowledge to refine the representation of a novel sample or enrich the supervision to a classifier during a transfer procedure. However, it is easy to introduce new noise during the transfer calculations since: (1) the unbalanced quantity of samples between the known (base) and the novel categories biases the contents capturing of the novel objects, and (2) the semantic gaps existing in different modalities weakens the knowledge interaction during the training. To reduce the influences of these issues in knowledge transfer-based FSL, this paper proposes a multi-directional knowledge transfer (MDKT). Specifically, (1) we use two independent unidirectional knowledge self-transfer strategies to calibrate the distributions of the novel categories from base categories in the visual and the textual space. It aims to yield transferable knowledge of the base categories to describe a novel category. (2) To reduce the inferences of semantic gaps, we first use a bidirectional knowledge connection to exchange the knowledge between the visual and the textual space. Then we adopt an online fusion strategy to enhance the expressions of the textual knowledge and improve the prediction accuracy of the novel categories by combining the knowledge from different modalities. Empirical studies on three FSL benchmark datasets demonstrate the effectiveness of MDKT, which improves the recognition accuracy on novel categories under limited samples, especially on $1$-shot and $2$-shot training tasks.

Cross-Modal Knowledge Distillation For Fine-Grained One-Shot Classification

Knowledge-Based Fine-Grained Classification for Few-Shot Learning.

Target Guided Knowledge Distillation for Cross-Domain Few-Shot Learning

Cross-Modal Knowledge Enhancement Mechanism for Few-Shot Learning

Research on a Cross-Domain Few-Shot Adaptive Classification Algorithm Based on Knowledge Distillation Technology

Cross-Domain and Cross-Modal Knowledge Distillation in Domain Adaptation for 3D Semantic Segmentation

Cross-Level Distillation and Feature Denoising for Cross-Domain Few-Shot Classification

Multi-domain few-shot image recognition with knowledge transfer

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

Multi-directional Knowledge Transfer for Few-Shot Learning

A Dimensional Structure based Knowledge Distillation Method for Cross-Modal Learning

Knowledge Transduction for Cross-Domain Few-Shot Learning

Multispectral Scene Classification via Cross-Modal Knowledge Distillation

SemCKD: Semantic Calibration for Cross-Layer Knowledge Distillation

Knowledge Distillation from Single to Multi Labels: an Empirical Study

Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

Knowledge Graph Enhanced Multimodal Learning for Few-shot Visual Recognition

Cross Modality Knowledge Distillation for Multi-modal Aerial View Object Classification

Integrating Knowledge Distillation with Learning to Rank for Few-Shot Scene Classification

Low-resolution Few-Shot Learning Via Multi-Space Knowledge Distillation