Cost-Driven Active Learning with Semi-Supervised Cluster Tree for Text Classification

zhaocai sun,yunming ye,yan li,shengchun deng,xiaolin du
DOI: https://doi.org/10.1007/978-3-319-05503-9_5
2014-01-01
Abstract:The key idea of active learning is that it can perform better with less data or costs if a machine learner is allowed to choose the data actively. However, the relation between labeling cost and model performance is seldom studied in the literature. In this paper, we thoroughly study this problem and give a criterion called as cost-performance to balance this relation. Based on the criterion, a cost-driven active SSC algorithm is proposed, which can stop the active process automatically. Empirical results show that our method outperforms active SVM and co-EMT. © Springer International Publishing Switzerland 2014.
What problem does this paper attempt to address?