A Survey of Class-Imbalanced Semi-Supervised Learning

Qian Gui,Hong Zhou,Na Guo,Baoning Niu
DOI: https://doi.org/10.1007/s10994-023-06344-7
IF: 5.414
2024-01-01
Machine Learning
Abstract:Semi-supervised learning(SSL) can substantially improve the performance of deep neural networks by utilizing unlabeled data when labeled data is scarce. The state-of-the-art(SOTA) semi-supervised algorithms implicitly assume that the class distribution of labeled datasets and unlabeled datasets are balanced, which means the different classes have the same numbers of training samples. However, they can hardly perform well on minority classes when the class distribution of training data is imbalanced. Recent work has found several ways to decrease the degeneration of semi-supervised learning models in class-imbalanced learning. In this article, we comprehensively review class-imbalanced semi-supervised learning (CISSL), starting with an introduction to this field, followed by a realistic evaluation of existing class-imbalanced semi-supervised learning algorithms and a brief summary of them.
What problem does this paper attempt to address?