Computing Semantic Relatedness Using Structured Information of Wikipedia

WANG Rui-qin,KONG Fan-sheng
DOI: https://doi.org/10.3785/j.issn.1008-973x.2009.02.022
2009-01-01
Abstract:A novel semantic relatedness measurement technique based on the link information of Wikipedia was presented.Comparing with WordNet repository,Wikipedia has wider range,more comprehensive knowledge and faster update speed,which makes it become an ideal resource in semantic management.Unlike other Wikipedia based semantic relatedness computing approaches,the new technique uses only Wikipedia's link structures rather than its full text content,which avoids from burdensome text processing.During the process of relatedness computation,the positive effects of incoming links and outcoming links were taken into account,meanwhile the link number adjustment factor was considered to eliminate the bias.Using several widely used test sets of manual defined measures of semantic relatedness as bench-mark,the proposed method resulted in substantial improvement in the correlation of computed relatedness score with the human judgments comparing with the previous WordNet-based methods and other Wikipedia-based methods.
What problem does this paper attempt to address?