Co-Training with Insufficient Views

Wei Wang,Zhi-Hua Zhou
2013-01-01
Abstract:Co-training is a famous semi-supervised learning paradigm exploiting unlabeled data with two views. Most previous theoretical analyses on co-training are based on the assumption that each of the views is sucient to correctly predict the label. However, this assumption can hardly be met in real applications due to feature corruption or various feature noise. In this paper, we present the theoretical analysis on co-training when neither view is sucient. We dene the diversity between the two views with respect to the condence of prediction and prove that if the two views have large diversity, co-training is able to improve the learning performance by exploiting unlabeled data even with insucient views. We also discuss the relationship between view insuciency and diversity, and give some implications for understanding of the dierence between co-training and co-regularization.
What problem does this paper attempt to address?