Semi-Supervised Classification with Universum

Dan Zhang,Jingdong Wang,Fei Wang,Changshui Zhang
DOI: https://doi.org/10.1137/1.9781611972788.29
2008-01-01
Abstract:The Universum data, deflned as a collection of "non- examples" that do not belong to any class of inter- est, have been shown to encode some prior knowledge by representing meaningful concepts in the same do- main as the problem at hand. In this paper, we ad- dress a novel semi-supervised classiflcation problem, called semi-supervised Universum, that can simultane- ously utilize the labeled data, unlabeled data and the Universum data to improve the classiflcation perfor- mance. We propose a graph based method to make use of the Universum data to help depict the prior infor- mation for possible classiflers. Like conventional graph based semi-supervised methods, the graph regulariza- tion is also utilized to favor the consistency between the labels. Furthermore, since the proposed method is a graph based one, it can be easily extended to the multi- class case. The empirical experiments on the USPS and MNIST datasets are presented to show that the pro- posed method can obtain superior performances over conventional supervised and semi-supervised methods.
What problem does this paper attempt to address?