Tri-Training Algorithm with Data Editing

ZHANG Yan,LIN Ying
DOI: https://doi.org/10.3969/j.issn1672-9722.2013.10.011
2013-01-01
Abstract:Tri-Training is a semi-supervised learning algorithm in which three learners keep on labeling unlabeled examples and retraining themselves on an enlarged training set.Since the Tri-training process may erroneously label some unlabeled examples,introduce the noise data and degrade the performance of classification.This paper utilizes the data editing methods including the DE-KNN,DE-BKNN and DENED to identify and remove the mislabeled examples from the labeled data based on Tri-Training algorithm.Some experiments are carried out on the six UCI data sets.The results of experiments show that the introduction of data editing is beneficial,and the learned hypotheses of data editing combination with Tri-Training outperform those learned by the standard Tri-training algorithm.Especially,the DE-NED method is better than others.
What problem does this paper attempt to address?