Extracting Domain-Relevant Term Using Wikipedia Based on Random Walk Model

Wenjuan Wu,Tao Liu,He Hu,Xiaoyong Du
DOI: https://doi.org/10.1109/chinagrid.2012.20
2012-01-01
Abstract:In this paper we present a new approach for the automatic identification of domain-relevant concepts and entities of a given domain using the category and page structures of the Wikipedia in a language independent way. By applying Markov random walk algorithm on the weighted Wikipedia link graph, our approach can identify large quantities of domain-relevant concepts and entities with very little human effort. Experimental results show that our method achieves high accuracy and acceptable efficiency in domain-relevant term extraction.
What problem does this paper attempt to address?