An Incremental Algorithm of Text Clustering Based on Semantic Sequences

Feng Zhonghui,Shen Junyi,Bao Junpeng
DOI: https://doi.org/10.1007/bf02829263
2006-01-01
Wuhan University Journal of Natural Sciences
Abstract:This paper proposed an incremental textclustering algorithm based on semantic sequence. Using similarity relation of semantic sequences and calculating the cover of similarity semantic sequences set, the candidate cluster with minimum entropy overlap value was selected as a result cluster every time in this algorithm. The comparison of experimental results shows that the precision of the algorithm is higher than other algorithms under same conditions and this is obvious especially on long documents set.
What problem does this paper attempt to address?