Hot topic detection based on combined content and time similarity

Yi Zhao,Kun Zhang,Hong Zhang,Xia Yan,Ying Cai
DOI: https://doi.org/10.1109/PIC.2017.8359580
2017-01-01
Abstract:Hot topic detection has always been a hot research field, and there are a large number of the applications of this technology in real life. Most of the previous work, however, focused only on the textual information of the news itself, while ignoring the other attributes of the news, such as the time the news was published, which can also tell the topic described in its perspective. And others use only one certain method to calculate the text similarity, which all have their disadvantages. To solve these problems, we proposed our own topic detection algorithm, which takes into account the information difference between the title and the text, combines several methods to calculate text similarity, and combines text and time similarity together. We tested the combined similarity calculation methods, and tested the effect of several time similarity equations. Then we took three different models to calculate the combined similarity which are linear model, quadratic polynomial model and neural network model. Finally, we give out the results and analysis of our experiments.
What problem does this paper attempt to address?