Abstract:Semi-supervised Learning (SSL) reduces the need for extensive annotations in deep learning, but the more realistic challenge of imbalanced data distribution in SSL remains largely unexplored. In Class Imbalanced Semi-supervised Learning (CISSL), the bias introduced by unreliable pseudo-labels can be exacerbated by imbalanced data distributions. Most existing methods address this issue at instance-level through reweighting or resampling, but the performance is heavily limited by their reliance on biased backbone representation. Some other methods do perform feature-level adjustments like feature blending but might introduce unfavorable noise. In this paper, we discuss the bonus of a more balanced feature distribution for the CISSL problem, and further propose a Balanced Feature-Level Contrastive Learning method (BaCon). Our method directly regularizes the distribution of instances' representations in a well-designed contrastive manner. Specifically, class-wise feature centers are computed as the positive anchors, while negative anchors are selected by a straightforward yet effective mechanism. A distribution-related temperature adjustment is leveraged to control the class-wise contrastive degrees dynamically. Our method demonstrates its effectiveness through comprehensive experiments on the CIFAR10-LT, CIFAR100-LT, STL10-LT, and SVHN-LT datasets across various settings. For example, BaCon surpasses instance-level method FixMatch-based ABC on CIFAR10-LT with a 1.21% accuracy improvement, and outperforms state-of-the-art feature-level method CoSSL on CIFAR100-LT with a 0.63% accuracy improvement. When encountering more extreme imbalance degree, BaCon also shows better robustness than other methods.

DeCAB: Debiased Semi-supervised Learning for Imbalanced Open-Set Data.

CDMAD: Class-Distribution-Mismatch-Aware Debiasing for Class-Imbalanced Semi-Supervised Learning

ABC: Auxiliary Balanced Classifier for Class-imbalanced Semi-supervised Learning

An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning

BaCon: Boosting Imbalanced Semi-supervised Learning via Balanced Feature-Level Contrastive Learning

OCI-SSL: Open Class-Imbalanced Semi-Supervised Learning with Contrastive Learning

Towards the Mitigation of Confirmation Bias in Semi-supervised Learning: a Debiased Training Perspective

SCD:Sampling-based Class Distribution for Imbalanced Semi-Supervised Learning

BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning

DCRP: Class-Aware Feature Diffusion Constraint and Reliable Pseudo-Labeling for Imbalanced Semi-Supervised Learning

Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning

A Survey of Class-Imbalanced Semi-Supervised Learning

Improving Barely Supervised Learning by Discriminating Unlabeled Samples with Super-Class

Relieving Long-tailed Instance Segmentation via Pairwise Class Balance

Learning Label Refinement and Threshold Adjustment for Imbalanced Semi-Supervised Learning

Class-Aware Contrastive Semi-Supervised Learning

Class-Imbalanced Semi-Supervised Learning with Adaptive Thresholding.

DeLaLA: Semisupervised Learning via Determinately Labeling and Kernelized Large Margin Projection

DC-SSL: Addressing Mismatched Class Distribution in Semi-supervised Learning

DHC: Dual-debiased Heterogeneous Co-training Framework for Class-imbalanced Semi-supervised Medical Image Segmentation

CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning