Abstract:Semi-Supervised Learning (SSL) has been proved to be an effective way to leverage both labeled and unlabeled data at the same time. Recent semi-supervised approaches focus on deep neural networks and have achieved promising results on several benchmarks: CIFAR10, CIFAR100 and SVHN. However, most of their experiments are based on models trained from scratch instead of pre-trained models. On the other hand, transfer learning has demonstrated its value when the target domain has limited labeled data. Here comes the intuitive question: is it possible to incorporate SSL when fine-tuning a pre-trained model? We comprehensively study how SSL methods starting from pretrained models perform under varying conditions, including training strategies, architecture choice and datasets. From this study, we obtain several interesting and useful observations. While practitioners have had an intuitive understanding of these observations, we do a comprehensive emperical analysis and demonstrate that: (1) the gains from SSL techniques over a fully-supervised baseline are smaller when trained from a pre-trained model than when trained from random initialization, (2) when the domain of the source data used to train the pre-trained model differs significantly from the domain of the target task, the gains from SSL are significantly higher and (3) some SSL methods are able to advance fully-supervised baselines (like Pseudo-Label). We hope our studies can deepen the understanding of SSL research and facilitate the process of developing more effective SSL methods to utilize pre-trained models. Code is now available at github.

DP-SSL: Towards Robust Semi-supervised Learning with A Few Labeled Samples

DeLaLA: Semisupervised Learning via Determinately Labeling and Kernelized Large Margin Projection

LaSSL: Label-Guided Self-Training for Semi-supervised Learning

Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data.

Robust Pseudo-Label Selection for Holistic Semi-Supervised Learning

Improving Barely Supervised Learning by Discriminating Unlabeled Samples with Super-Class

Robust Deep Semi-Supervised Learning: A Brief Introduction

Rethinking Pseudo-labeled Sample Mining for Semi-Supervised Object Detection

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

On Pseudo-Labeling for Class-Mismatch Semi-Supervised Learning

Semi-Supervised Learning via Weight-aware Distillation under Class Distribution Mismatch

Robust Semi-Supervised Learning when Not All Classes have Labels

Boosting Semi-Supervised Learning with Dual-Threshold Screening and Similarity Learning

An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning

Leveraging Local Variance for Pseudo-Label Selection in Semi-supervised Learning

DC-SSL: Addressing Mismatched Class Distribution in Semi-supervised Learning

Enhancing Semi-Supervised Learning via Representative and Diverse Sample Selection

Semi-Supervised Learning in the Few-Shot Zero-Shot Scenario

When Semi-Supervised Learning Meets Transfer Learning: Training Strategies, Models and Datasets.

Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation