VSM-based Text Clustering Algorithm

姚清耘,刘功申,李翔
DOI: https://doi.org/10.3969/j.issn.1000-3428.2008.18.014
2008-01-01
Abstract:Text clustering, one of the most important research braches of clustering, is the application of clustering algorithm in text processing.This paper discusses different Vector Space Model(VSM)-based clustering algorithms and presents an improved text clustering algorithm——Level-Panel(LP) algorithm.In addition, according to the effects of clustering for the corpus, it presents optimizations of clustering algorithm, including dimension determining, feature selection, etc.It is proved that LP algorithm can effectively reduce the time spending in clustering process.It is high in practicability and flexibility.
What problem does this paper attempt to address?