Supervised Contrastive Representation Embedding Based on Transformer for Few-Shot Classification

Jianxin Wu,Xiangtao Tian,Guoqiang Zhong
DOI: https://doi.org/10.1088/1742-6596/2278/1/012022
2022-06-04
Journal of Physics: Conference Series
Abstract:Few-shot learning as an emerging problem has attracted intensive attention in the data-limited situations of Computer Vision. Recently, massive works concentrate on exploiting meta learning to improve performance of recognition. However, most of them overlook the intra-class and inter-class relationships within the support set during episodic training, which are significant and important for the tasks of downstream. To tackle this problem, we present an efficient supervised contrastive representation embedding for few-shot classification. Specially, (1) we employ Swin Transformer as the backbone to replace CNN architecture in order to explore the huge potential of transformer-based backbone for the field of few-shot learning. (2) We introduce supervised contrastive loss to meta learning to take good advantage of extremely limited relations for the first time. Combined the classifier based on metric learning, extensive experiments have demonstrated the efficiency of the representation embedding. We conduct experiments and establish competitive results on two widely-used few-shot classification benchmarks: Fewshot-CIFAR 100 (FC-100) and MiniImageNet.
What problem does this paper attempt to address?