Internet Popular Topics Extraction of Traffic Content Words Correlation

Yadong Zhou,Qindong Sun,Xiaohong Guan,Wei Li,Jing Tao
DOI: https://doi.org/10.3321/j.issn:0253-987x.2007.10.004
2007-01-01
Abstract:Aiming at the requirements of network public feeling analysis, the formal definition and description of the popular topic on Internet is presented, the relationship between hot words and popular topics is analyzed, and finally a hotpoint words correlation computing approach for extracting popular topics on Internet is introduced in traffic contents. Based on that, DBSCAN (Density-Based Spatical Clustering of Application with Noise) clustering algorithm is adopted to extract popular topics and formalized results are given. The test results show that this method has an availability of 16.7% in extracting Internet popular topics, which, compared to web mining and TDT (Topic Detection and Tracking), can provide a more suitable data source for effective recovery of Internet public opinions.
What problem does this paper attempt to address?