OCI-SSL: Open Class-Imbalanced Semi-Supervised Learning with Contrastive Learning

Yuting Zhou,Can Gao,Jie Zhou,Weiping Ding,Linlin Shen,Zhihui Lai
DOI: https://doi.org/10.1109/tetci.2024.3372436
2024-01-01
IEEE Transactions on Emerging Topics in Computational Intelligence
Abstract:Semi-supervised learning (SSL) is a powerful technique that leverages unlabeled data to improve model performance. Conventional SSL algorithms generally make the assumption that the unlabeled data are derived from approximately balanced known classes. However, in real-world scenarios, the unlabeled data may come from imbalanced known classes and out-of-distribution (OOD) unknown classes, which significantly impacts the performance of SSL algorithms. In this study, a more realistic framework for open class-imbalanced semi-supervised learning (OCI-SSL) is presented to address the challenges posed by imbalanced class distribution and OOD novel classes. To alleviate the adverse effects caused by the OOD data, an improved one-vs-all classifier that incorporates the strategies of hard-negative sampling and Bernoulli sampling is proposed to identify OOD samples. Then an auxiliary balancing classifier is designed to improve both the supervised loss of labeled data and the consistency regularization loss of unlabeled data in the presence of class imbalance by introducing a mask of rebalancing class distribution. Moreover, a weight-parameterized semi-supervised contrastive learning method is developed to enhance feature learning for all in-distribution data. Extensive experiments demonstrate that our method outperforms state-of-the-art methods and achieves an average accuracy improvement of 11.7% and AUROC improvement of 50.3% on CIFAR-10.
What problem does this paper attempt to address?