A Dynamic Som Algorithm For Clustering Large-Scale Document Collection

Kegang Luo,Yuanchao Liu,Xiaolong Wang
DOI: https://doi.org/10.1109/ALPIT.2007.55
2007-01-01
Abstract:A dynamic SOM algorithm of incremental gradient descent to cluster large-scale document collection is proposed in this paper. In comparison with other SOM algorithms (e.g. GHSOM), the size of output layer our algorithm can be gradually reduced and dynamically by inserting suitable number of neurons, thus the number of underutilized neurons can be reduced greatly and the training results of this algorithm can fully represent the distribution of topics in document collection. 177 addition, When Using this algorithm to cluster large-scale documents the computation cost can also be shortened remarkably. The overused neurons have been split again to optimize the cluster results further. A good result of cluster can be gained. Experiments results proved the effectiveness of this algorithm.
What problem does this paper attempt to address?