Abstract:Co-training, an advanced form of self-training, allows multiple base models to learn collaboratively, leading to superior performance in semi-supervised learning tasks. However, its widespread adoption is hindered by high computational costs and intricate design choices. To address these challenges, we present Multi-Head Co-Training, a streamlined and efficient framework that consolidates individual models into a multi-head structure, adding minimal extra parameters. Each classification head in this unified model collaborates with others via a “Weak and Strong Augmentation” strategy, with diversity organically introduced through robust data augmentation. Consequently, our approach implicitly promotes diversity while incurring only a minor increase in computational overhead, making co-training more accessible. We validate the effectiveness of Multi-Head Co-Training through an empirical study on standard semi-supervised learning benchmarks. For example, our method achieves up to a 3.1% accuracy improvement on the semi-supervised CIFAR dataset compared to recent methods.Recognizing the necessity for more practical performance metrics beyond accuracy, we assess our framework from three additional perspectives: robust generalization, uncertainty, and computational efficiency. To evaluate robust generalization, we expand the conventional SSL experimental setting to a more comprehensive open-set semi-supervised learning scenario. For uncertainty assessment, we conduct experiments on model calibration and selective classification benchmarks. For example, our method achieves up to a 4.3% accuracy improvement on the open-set semi-supervised CIFAR dataset. Our extensive experiments confirm that our proposed framework better captures prediction confidence and uncertainty, rendering it more suitable for SSL deployment in open environments. The code is available at https://github.com/chenmc1996/Multi-Head-Co-Training.

Inter-training: Exploiting Unlabeled Data in Multi-Classifier Systems

Multi-head Co-Training: an Uncertainty-Aware and Robust Semi-Supervised Learning Framework

Semi-Supervised Learning with Multi-Head Co-Training

Tri-Training: Exploiting Unlabeled Data Using Three Classifiers

CoTrade: Confident Co-Training With Data Editing.

Theoretical Foundation of Co-Training and Disagreement-Based Algorithms.

An improved co-training style algorithm:Compatible Co-training

Self-paced Multi-view Co-training.

A Hybrid Generative/discriminative Method for Semi-Supervised Classification

Co-Training with Insufficient Views

Analyzing Co-training Style Algorithms

Diverse Cotraining Makes Strong Semi-Supervised Segmentor

Multi-view Collaborative Semi-Supervised Classification Algorithm Based on Diversity Measurers of Classifier with the Combination of Agreement and Disagreement Label Rules

Stacked Co-Training for Semi-Supervised Multi-Label Learning

Learning with Weak Views Based on Dependence Maximization Dimensionality Reduction.

Hessian-regularized Co-Training for Social Activity Recognition.

Towards Making Co-Training Suffer Less from Insufficient Views

A Co-Training Approach for Sequential Three-Way Decisions

DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation

Cross-to-merge training with class balance strategy for learning with noisy labels

A New Analysis of Co-Training