SIVLC: Improving the Performance of Co-Training by Sufficient-Irrelevant Views and Label Consistency

Yanlu Gong,Quanwang Wu
DOI: https://doi.org/10.1007/s10489-023-04611-7
IF: 5.3
2023-01-01
Applied Intelligence
Abstract:As one of the most successful paradigms for semi-supervised learning, co-training trains two classifiers through two views, and it selects unlabeled samples and adds them to the labeled sample set iteratively. The traditional co-training method requires two natural self-sufficient and independent views, which is too strict and limits the applicability of co-training. Although several methods have been proposed to relax this requirement via manual view partition, they only consider difference of generated views and neglect relationship among them. Moreover, sample labels predicted by different classifiers in co-training may be not consistent but this consistency has not been fully exploited by existing methods. To solve these issues, we design a method that uses sufficient-irrelevant views and label consistency (SIVLC) to improve the performance of co-training. SIVLC can manually generate any number of views through the concept of sufficient-irrelevant views to expand the application scope of co-training. Moreover, it takes into account performance difference of classifiers and hence the label consistency of labeled data is used to measure the weight of classifiers while that of unlabeled data is applied to select unlabeled samples with high confidence. Since the performance of classifiers during training may fluctuate, a decision strategy of verification set is set in SIVLC. Four groups of experiments with 18 data sets are conducted and the results reveal that the proposed method is more effective than some state-of-the-art ones.
What problem does this paper attempt to address?