Categorizing and Ranking Search Engine's Results by Semantic Similarity

Tianyong Hao,Zhi Lu,Shitong Wang,Tiansong Zou,Shenhua Gu,Liu Wenyin
DOI: https://doi.org/10.1145/1352793.1352854
2008-01-01
Abstract:An automatic method for text categorizing and ranking search engine's results by semantic similarity is proposed in this paper. We first obtain nouns and verbs from snippets obtained from search engine using Name Entity Recognition and part-of speech. A semantic similarity algorithm based on WordNet is proposed to calculate the similarity of each snippet to each of the pre-defined categories. A balanced similarity ranking method combined with Google's rank and timeliness of the pages is proposed to rank these snippets. Preliminary experiments with 500 labeled questions from TREC03 show that 72.7% are correctly categorized.
What problem does this paper attempt to address?