Abstract:Co-training, an advanced form of self-training, allows multiple base models to learn collaboratively, leading to superior performance in semi-supervised learning tasks. However, its widespread adoption is hindered by high computational costs and intricate design choices. To address these challenges, we present Multi-Head Co-Training, a streamlined and efficient framework that consolidates individual models into a multi-head structure, adding minimal extra parameters. Each classification head in this unified model collaborates with others via a “Weak and Strong Augmentation” strategy, with diversity organically introduced through robust data augmentation. Consequently, our approach implicitly promotes diversity while incurring only a minor increase in computational overhead, making co-training more accessible. We validate the effectiveness of Multi-Head Co-Training through an empirical study on standard semi-supervised learning benchmarks. For example, our method achieves up to a 3.1% accuracy improvement on the semi-supervised CIFAR dataset compared to recent methods.Recognizing the necessity for more practical performance metrics beyond accuracy, we assess our framework from three additional perspectives: robust generalization, uncertainty, and computational efficiency. To evaluate robust generalization, we expand the conventional SSL experimental setting to a more comprehensive open-set semi-supervised learning scenario. For uncertainty assessment, we conduct experiments on model calibration and selective classification benchmarks. For example, our method achieves up to a 4.3% accuracy improvement on the open-set semi-supervised CIFAR dataset. Our extensive experiments confirm that our proposed framework better captures prediction confidence and uncertainty, rendering it more suitable for SSL deployment in open environments. The code is available at https://github.com/chenmc1996/Multi-Head-Co-Training.

Towards Making Co-Training Suffer Less from Insufficient Views

Co-Training with Insufficient Views

An improved co-training style algorithm:Compatible Co-training

Learning with Weak Views Based on Dependence Maximization Dimensionality Reduction.

When Does Co-Training Work in Real Data?

Analyzing Co-training Style Algorithms

A New Analysis of Co-Training

Safe co-training for semi-supervised regression

Theoretical Foundation of Co-Training and Disagreement-Based Algorithms.

Semi-Supervised Learning with Multi-Head Co-Training

CoTrade: Confident Co-Training With Data Editing.

Enhancing Web Page Classification via Local Co-training

Self-paced Multi-view Co-training.

Inter-training: Exploiting Unlabeled Data in Multi-Classifier Systems

View Consistency based Co-training style MultiView Learning : A Privacy-Preserving Approach

A Co-training Approach for Noisy Time Series Learning

Diverse Cotraining Makes Strong Semi-Supervised Segmentor

Co-learning: Learning from Noisy Labels with Self-supervision

When less is more: on the value of "co-training" for semi-supervised software defect predictors

Hessian-regularized Co-Training for Social Activity Recognition.

Multi-head Co-Training: an Uncertainty-Aware and Robust Semi-Supervised Learning Framework