Chinese Auto-Clustering of Oral Conversation Corpus Based on Contextual Features

yue chen,qi chen,minghu jiang
DOI: https://doi.org/10.14355/spr.2015.04.004
2015-01-01
Signal Processing Research
Abstract:Chinese text clustering requires more linguistic knowledge in order to understand and analyze natural language accurately. To improve the accuracy of such clustering, in this article, we adopt SOM algorithm to add contextual features into the process of Chinese auto-clustering of oral corpus based on a contextual dictionary, and testify the effect of such a pragmatic application.
What problem does this paper attempt to address?