A Novel Method For Clustering Web Search Results With Wikipedia Disambiguation Pages

Zhi Huang,Zhendong Niu,Donglei Liu,Wenjuan Niu,Wei Wang
DOI: https://doi.org/10.1007/978-3-319-22324-7_1
2015-01-01
Abstract:Organizing search results of an ambiguous query into topics can facilitate information search on the Web. In this paper, we propose a novel method to cluster search results of ambiguous query into topics about the query constructed from Wikipedia disambiguation pages (WDP). To improve the clustering result, we propose a concept filtering method to filter semantically unrelated concepts in each topic. Also, we propose the top K full relations (TKFR) algorithm to assign search results to relevant topics based on the similarities between concepts in the results and topics. Comparing with the clustering methods whose topic labels are extracted from search results, the topics of WDP which are edited by human are much more helpful for navigation. The experiment results show that our method can work for ambiguous queries with different query lengths and highly improves the clustering result of method using WDP.
What problem does this paper attempt to address?