Describing Web Topics Meticulously through Word Graph Analysis

Bai Sun,Lei Shi,Liang Kong,Yan Zhang
DOI: https://doi.org/10.1109/CIT.2009.55
2009-01-01
Abstract:Topic description is as important as topic detection. Inthis paper, we propose a novel method to describe Web topicswith topic words. Under the assumption that representativewords exist in important sentences and have high probabilityof occurrence with other representative words, twographs are built, one of which represents the relationshipfor sentences, the other for words. Considering a topic clustercontains a set of different Web pages, sentence clustersare also introduced. Experimental results on a real dataset show that our method achieves excellent performance inboth high precision and efficiency, especially when realWebdata contain mass of noises.
What problem does this paper attempt to address?