Text Clustering Based on Automatic Partition of Feature Item Weight

YU Yong-hong,BAI Wen-yang
DOI: https://doi.org/10.3969/j.issn.1000.3842.2011.11.009
2011-01-01
Abstract:(Abstract )This paper introduces a novel automatic text clustering method, in which the Genetic Algorithm(GA) is applied to the global optimal and high searching efficient feature selection to achieve dimensionality reduction, then appropriate number of partitions of document set are created according to the different combinations of feature weights, and each document partition is clustered into an initial clusters based on dynamic programming technique, and all initial clusters are clustered using the same method to final text clusters. Experimental results show the method can achieve dimensionality reduction efficiently, improve the text clustering precision, and decrease the clustering time.
What problem does this paper attempt to address?