Abstract:In this paper, we study weakly supervised learning where a large amount of data supervision is not accessible. This includes i) incomplete supervision, where only a small subset of labels is given, such as semi-supervised learning and domain adaptation; ii) inexact supervision, where only coarse-grained labels are given, such as multi-instance learning and iii) inaccurate supervision, where the given labels are not always ground-truth, such as label noise learning. Unlike supervised learning which typically achieves performance improvement with more labeled examples, weakly supervised learning may sometimes even degenerate performance with more weakly supervised data. Such deficiency seriously hinders the deployment of weakly supervised learning to real tasks. It is thus highly desired to study safe weakly supervised learning, which never seriously hurts performance. To this end, we present a generic ensemble learning scheme to derive a safe prediction by integrating multiple weakly supervised learners. We optimize the worst-case performance gain and lead to a maximin optimization. This brings multiple advantages to safe weakly supervised learning. First, for many commonly used convex loss functions in classification and regression, it is guaranteed to derive a safe prediction under a mild condition. Second, prior knowledge related to the weight of the base weakly supervised learners can be flexibly embedded. Third, it can be globally and efficiently addressed by simple convex quadratic or linear program. Finally, it is in an intuitive geometric interpretation with the least square loss. Extensive experiments on various weakly supervised learning tasks, including semi-supervised learning, domain adaptation, multi-instance learning and label noise learning demonstrate our effectiveness.

Training Subset Selection for Weak Supervision

Data Consistency for Weakly Supervised Learning

A Brief Introduction to Weakly Supervised Learning

Towards a statistical theory of data selection under weak supervision

Universalizing Weak Supervision

Weak Supervision Performance Evaluation via Partial Identification

End-to-End Weak Supervision

Constrained Labeling for Weakly Supervised Learning

Data Subset Selection with Imperfect Multiple Labels

A Survey on Programmatic Weak Supervision

Creating Training Sets via Weak Indirect Supervision

AutoWS: Automated Weak Supervision Framework for Text Classification

Understanding temporally weakly supervised training: A case study for keyword spotting

Bandit Label Inference for Weakly Supervised Learning

Lifting Weak Supervision To Structured Prediction

Multiple weak supervision for short text classification

Weak Supervision Learning for Object Co-Segmentation

Towards Safe Weakly Supervised Learning

Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts

A Weak Supervision Approach for Few-Shot Aspect Based Sentiment

Reliable Weakly Supervised Learning: Maximize Gain and Maintain Safeness