Active Semi-supervised Text Clustering Based on Pairwise Constraints

ZHONG Jiang,LIU Long-hai,LIANG Chuan-wei
DOI: https://doi.org/10.3969/j.issn.1000-3428.2011.13.059
2011-01-01
Abstract:An active method which can effectively select pairwise constraints is constructed.By using this method,an active semi-supervised text clustering method based on pairwise constraints is proposed.Latent Semantic Index(LSI) is used to reduce the dimension of text features.In the clustering process,it uses the proposed method to actively select pairwise constraints,and then uses these pairwise constraints to steer the clustering process towards an appropriate partition.Experimental results show that the proposed method can effectively improve the text clustering results by using a small amount of pairwise constraints.
What problem does this paper attempt to address?