A General Framework for Learning from Weak Supervision

Hao Chen,Jindong Wang,Lei Feng,Xiang Li,Yidong Wang,Xing Xie,Masashi Sugiyama,Rita Singh,Bhiksha Raj
2024-06-06
Abstract:Weakly supervised learning generally faces challenges in applicability to various scenarios with diverse weak supervision and in scalability due to the complexity of existing algorithms, thereby hindering the practical deployment. This paper introduces a general framework for learning from weak supervision (GLWS) with a novel algorithm. Central to GLWS is an Expectation-Maximization (EM) formulation, adeptly accommodating various weak supervision sources, including instance partial labels, aggregate statistics, pairwise observations, and unlabeled data. We further present an advanced algorithm that significantly simplifies the EM computational demands using a Non-deterministic Finite Automaton (NFA) along with a forward-backward algorithm, which effectively reduces time complexity from quadratic or factorial often required in existing solutions to linear scale. The problem of learning from arbitrary weak supervision is therefore converted to the NFA modeling of them. GLWS not only enhances the scalability of machine learning models but also demonstrates superior performance and versatility across 11 weak supervision scenarios. We hope our work paves the way for further advancements and practical deployment in this field.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve two major challenges in weakly - supervised learning (WSL): **generality and scalability**. Specifically: 1. **Generality challenge**: - In practical applications, weakly - supervised data comes in various forms, including partial labels, aggregated statistical information, pairwise observations, and unlabeled data, etc. Existing methods can usually only handle specific types of weakly - supervised data and lack a general framework that can uniformly handle all these forms. - The paper proposes a General Framework for Learning from Weak Supervision (GLWS), which can flexibly handle various types of weakly - supervised data. 2. **Scalability challenge**: - Existing methods have high computational complexity when dealing with complex weakly - supervised data, which limits their application on large - scale datasets. For example, some methods require quadratic or factorial - level computational complexity when dealing with long - sequence data. - GLWS reduces the computational complexity from quadratic or factorial - level to linear - level by introducing Non - deterministic Finite Automaton (NFA) and Forward - Backward Algorithm, thus improving the model's scalability. ### Main contributions 1. **Proposing the GLWS framework**: - GLWS is based on the Expectation - Maximization (EM) formula and can handle any form of weakly - supervised data, including partial labels, aggregated statistics, pairwise observations, and unlabeled data. - The framework contains two learning objectives: an unsupervised instance - consistency term and a supervised target, ensuring that the model predictions are consistent with the weakly - supervised information. 2. **Designing an efficient Forward - Backward algorithm**: - By modeling weakly - supervised information as NFA and using the Forward - Backward algorithm to calculate the EM formula in linear time, the computational complexity is significantly reduced. - This enables GLWS to run efficiently on large - scale datasets and is suitable for more practical scenarios. 3. **Extensive experimental verification**: - Under 11 different weakly - supervised settings, GLWS has achieved state - of - the - art performance, demonstrating its generality and effectiveness. - The experiments cover multiple datasets such as CIFAR - 10, CIFAR - 100, STL - 10, and ImageNet - 100, further proving the wide applicability of GLWS. By solving the above challenges, GLWS paves the way for the practical application of weakly - supervised learning and promotes the further development of this field.