Exploiting Multiple Resources For Word-Phrase Semantic Similarity Evaluation

Xiaoqiang Jin,Chengjie Sun,Lei Lin,Xiaolong Wang
DOI: https://doi.org/10.1007/978-3-319-12277-9_5
2014-01-01
Abstract:Previous researches on semantic similarity calculating have been mainly focused on documents, sentences or concepts. In this paper, we study the semantic similarity of words and compositional phrases. The task is to judge the semantic similarity of a word and a short sequence of words. Based on structured resource (WordNet), semi-structured resource (Wikipedia) and unstructured resource (Web), this paper extracts rich effective features to represent the word-phrase pair. The task can be treated as a binary classification problem and we employ Support Vector Machine to estimate whether the word and phrase is similar given a word-phrase pair. Experiments are conducted on SemEval 2013 Task5a. Our method achieves 82.9% in accuracy, and outperforms the best system (80.3%) that participates in the task. Experimental results demonstrate the effectiveness of our proposed approach.
What problem does this paper attempt to address?