Text Clustering on Oral Conversation Corpus.

Ding Liu,Minghu Jiang
DOI: https://doi.org/10.1007/978-3-642-30732-4_22
2012-01-01
Abstract:This article describes a method that use some context information terms in text clustering base on oral conversation corpus. And we used various distance measurement in the SOM algorithm experiment and the K-means algorithm experiment to test it. The experimental results showed us the context information terms take effect on text clustering, because of its high occurrence frequency. And we found that Hamming distance measurement is the best choice in SOM algorithm.
What problem does this paper attempt to address?