Semi-Supervised Semantic Dynamic Text Clustering Algorithm

QIAN Zhi-sen,HUANG Rui-zhang,WEI Qin,QIN Yong-bin,CHEN Yan-ping
DOI: https://doi.org/10.3969/j.issn.1001-0548.2019.06.001
2019-01-01
Abstract:In the traditional dynamic text clustering, the similar texts with different descriptions are divided into different groups; and the difference between the number of cluster categories and the number of real categories is obvious. Aiming at these problems, this paper proposes a semi-supervised semantic dynamic text clustering algorithm (SDCS). The algorithm captures the semantic relationship between texts by semantically representing the text, and dynamically learns the category semantics during the clustering process, so that the text can be accurately clustered according to semantics. At the same time, the algorithm uses the semi-supervised clustering algorithm to supervise the generation of new classes, and produces clustering results that are consistent with the actual situation. The experimental results show that the proposed algorithm is effective and feasible.
What problem does this paper attempt to address?