High Reliable Multi-View Semi-Supervised Learning with Extremely Sparse Labeled Data

Shiliang Sun
DOI: https://doi.org/10.1109/HIS.2008.12
2008-01-01
Abstract:Most semi-supervised learning methods assume there are a number of labeled data available in order to learn a classifier which then exploits a large set of unlabeled data. However, for some applications, there are only extremely spare labeled examples attainable (say, one example per category). In this case, these semi-supervised learning methods can not work. In this paper, a new method for seeking more examples with high reliable labels based on the limited labeled data is proposed. By investigating the correlation between different views through canonical correlation analysis, our method can launch semi-supervised learning using only one labeled example from each class. Experiments on text classification show the effectiveness of the proposed method.
What problem does this paper attempt to address?