A Survey of Semi-supervised Text Categonzation

NIU Gang,LUO Aibao,SHANG Lin
DOI: https://doi.org/10.3778/j.issn.1673-9418.2011.04.003
2011-01-01
Abstract:Text categorization is a regular problem in people daily work and an interesting research area of machine learning. Semi-supervised learning algorithms, which consider both labeled and unlabeled data, can improve learning effectiveness significantly. This paper gives the definition and characteristic of text categorization and introduces the traditional supervised learning algorithms and evaluation indicators. Then it analyzes the characteristic and basic theory of semi-supervised text categorization, and discusses some algorithms on semi-supervised text categorization, such as Bayesian method and regularization method.
What problem does this paper attempt to address?