Text Clustering Algorithm Based on Spectral Graph Seriation

Guo Wensheng,Li Guohe
DOI: https://doi.org/10.1109/ccdc.2009.5192371
2009-01-01
Abstract:In the field of information processing, most of the existing text clustering algorithm is based on Vector Space Model(VSM). However, VSM can not effectively express the structure of the text so that it can not fully express the semantic information of the text. In order to improve the ability of expression in the semantic information, this paper presents a new text structure graph model. With the weighted graph, this model expresses the characteristics term of the text and its associated location information. On this basis of spectral graph seriation, a spectral clustering algorithm is put forward. This algorithm replace solving common subgraph with matrix computation, then reduce the computational complexity of graph clustering. There are also algorithm analysis and experiment in the paper. The results of the study show that the text clustering algorithm based on spectral graph seriation is effective and feasible.
What problem does this paper attempt to address?