A Comparative Study on Text Clustering Methods

Yan Zheng,Xiaochun Cheng,Ronghuai Huang,Yi Man
DOI: https://doi.org/10.1007/11811305_71
2006-01-01
Abstract:Text clustering is one of the most important research areas in text mining, which handles the text automatically to discover implicit knowledge. It groups text into different clusters by contents without apriori knowledge. In this paper, different text clustering methods are studied and three text clustering validation criteria are studied and used to evaluate the experimental results. We compare and contrast the effectiveness of k-means and FIHC text clustering methods by experiments, and address the different levels of quality of the resulting text clusters.
What problem does this paper attempt to address?