Abstract:Few-shot learning in image classification aims to learn a classifier to classify images when only few training examples are available for each class. Recent work has achieved promising classification performance, where an image-level feature based measure is usually used. In this paper, we argue that a measure at such a level may not be effective enough in light of the scarcity of examples in few-shot learning. Instead, we think a local descriptor based image-to-class measure should be taken, inspired by its surprising success in the heydays of local invariant features. Specifically, building upon the recent episodic training mechanism, we propose a Deep Nearest Neighbor Neural Network (DN4 in short) and train it in an end-to-end manner. Its key difference from the literature is the replacement of the image-level feature based measure in the final layer by a local descriptor based image-to-class measure. This measure is conducted online via a k-nearest neighbor search over the deep local descriptors of convolutional feature maps. The proposed DN4 not only learns the optimal deep local descriptors for the image-to-class measure, but also utilizes the higher efficiency of such a measure in the case of example scarcity, thanks to the exchangeability of visual patterns across the images in the same class. Our work leads to a simple, effective, and computationally efficient framework for few-shot learning. Experimental study on benchmark datasets consistently shows its superiority over the related state-of-the-art, with the largest absolute improvement of 17% over the next best. The source code can be available from https://github.com/WenbinLee/DN4.git.

Local descriptor-based spatial cross attention network for few-shot learning

Imposing Semantic Consistency of Local Descriptors for Few-Shot Learning

Local Spatial Alignment Network for Few-Shot Learning

Spatial Attention Network for Few-Shot Learning

In defense of local descriptor-based few-shot object detection

A Simple Task-aware Contrastive Local Descriptor Selection Strategy for Few-shot Learning between inter class and intra class

TALDS-Net: Task-Aware Adaptive Local Descriptors Selection for Few-shot Image Classification

Cross Attention Network for Few-shot Classification

Revisiting Local Descriptor Based Image-To-Class Measure for Few-Shot Learning

SpatialFormer: Semantic and Target Aware Attentions for Few-Shot Learning

LGSim: local task-invariant and global task-specific similarity for few-shot classification

LDCA: Local Descriptors with Contextual Augmentation for Few-Shot Learning

Learning more discriminative local descriptors with parameter-free weighted attention for few-shot learning

Reasearch on Cross Domain Few-shot Learning Method Based on Local Feature Association

Channel-spatial attention network for fewshot classification

Learning Task-aware Local Representations for Few-shot Learning.

Local Feature Semantic Alignment Network for Few-Shot Image Classification

BDLA: Bi-directional local alignment for few-shot learning

Local feature graph neural network for few-shot learning

Boosting Few-Shot Segmentation via Instance-Aware Data Augmentation and Local Consensus Guided Cross Attention