Researching on A Method of Topic Detection Based on Word Correlation Graph

Wenxuan ZHOU,Zengzhuang XU,Yu HONG,Guodong ZHOU
DOI: https://doi.org/10.13451/j.cnki.shanxi.univ(nat.sci.).2018.01.005
2018-01-01
Abstract:Online news topic detection aims to grouping news that discusses about the same topic automatically from large-scale online news.Since the type and scale of topics are not pre-defined and there is not any prior knowledge available,the existing researches often use the clustering algorithm to realize the automatic detection of topics.However,clustering algorithm is weak at distinguishing on similar news which actually belongs to different topics.In order to solve the problem above,we propose a text merging algorithm based on the structure of "Community".The method goes into the specific cluster and performs a second division on them.In general,the method combines text content and social network as characteristic information to determine degree of the internal and external relevance of the topic,and form a joint judge ment model.The results show that the proposed method can improve the purity by 11% and reduce the entropy of clustering results by 0.258.
What problem does this paper attempt to address?