Pseudo-Labeling Based Practical Semi-Supervised Meta-Training for Few-Shot Learning

Xingping Dong,Tianran Ouyang,Shengcai Liao,Bo Du,Ling Shao
2024-09-15
Abstract:Most existing few-shot learning (FSL) methods require a large amount of labeled data in meta-training, which is a major limit. To reduce the requirement of labels, a semi-supervised meta-training (SSMT) setting has been proposed for FSL, which includes only a few labeled samples and numbers of unlabeled samples in base classes. However, existing methods under this setting require class-aware sample selection from the unlabeled set, which violates the assumption of unlabeled set. In this paper, we propose a practical semi-supervised meta-training setting with truly unlabeled data to facilitate the applications of FSL in realistic scenarios. To better utilize both the labeled and truly unlabeled data, we propose a simple and effective meta-training framework, called pseudo-labeling based meta-learning (PLML). Firstly, we train a classifier via common semi-supervised learning (SSL) and use it to obtain the pseudo-labels of unlabeled data. Then we build few-shot tasks from labeled and pseudo-labeled data and design a novel finetuning method with feature smoothing and noise suppression to better learn the FSL model from noise labels. Surprisingly, through extensive experiments across two FSL datasets, we find that this simple meta-training framework effectively prevents the performance degradation of various FSL models under limited labeled data, and also significantly outperforms the state-of-the-art SSMT models. Besides, benefiting from meta-training, our method also improves two representative SSL algorithms as well.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is **to reduce the dependence on a large amount of labeled data in few - shot learning (FSL)**. Specifically, existing FSL methods require a large amount of labeled data in the meta - training phase, which has become a major limitation. To reduce the need for labeled data, the authors propose a semi - supervised meta - training (SSMT) setting, which contains only a small number of labeled samples and a large number of unlabeled samples in the base classes. However, existing methods need class - aware sample selection from the unlabeled set in this setting, which violates the assumptions of the unlabeled set. To solve these problems, the authors propose a new practical semi - supervised meta - training setting, using truly unlabeled data to promote the application of FSL in real - world scenarios. To this end, they design a simple and effective meta - training framework, called Pseudo - Labeling Based Meta - Learning (PLML). This framework is implemented through the following steps: 1. **Pre - training phase**: - Use a common semi - supervised learning (SSL) method to train a classifier and use it to obtain pseudo - labels for unlabeled data. 2. **Fine - tuning phase**: - Construct few - shot tasks from labeled data and pseudo - labeled data. - Design a new fine - tuning method, including feature smoothing and noise suppression, to better learn FSL models from noisy labels. Through extensive experiments, the authors find that this simple meta - training framework can effectively prevent the performance degradation of various FSL models under limited labeled data and significantly outperforms existing SSMT models. In addition, thanks to meta - training, their method also improves several representative SSL algorithms. ### Main contributions 1. **Propose a more practical semi - supervised meta - training setting** as a platform to promote the training of FSL models using a small amount of labeled base data. This is the first attempt to investigate and remove class - prior selection in SSFSL. 2. **Propose a simple and effective semi - supervised training algorithm**, called Pseudo - Labeling Based Meta - Learning (PLML), to reduce the need for labeled base data for most FSL models. This is the first time that the semi - supervised pre - training method has been introduced into SSFSL based on meta - training. 3. **Discover that the meta - learning strategy can further improve the performance of recent SSL models**, indicating that combining meta - learning may be a promising direction for developing new SSL methods. 4. **The method is model - independent**, successfully combining SSL with FSL and significantly improving the performance of FSL models (such as EP and Proto) under limited labeled base training data.