Automatic Domain Terminology Extraction Using Graph Mutual Reinforcement

Jingjing Kang,Xiaoyong Du,Tao Liu,He Hu
DOI: https://doi.org/10.1007/978-3-642-14246-8_63
2010-01-01
Abstract:Information Extraction (IE) aims at mining knowledge from unstructured data. Terminology extraction is one of crucial subtasks in IE. In this paper, we propose a novel approach of domain terminology extraction based on ranking, according to linkage of authors, papers and conferences in domain proceedings. Candidate terms are extracted by statistical methods and then ranked by the values of importance derived from mutual reinforcement result in the author-paper-conference graph. Furthermore, we integrate our approach with several classical termhood-based methods including C-value and inverse document frequency. The presented approach does not require any training data, and can be extended to other domains. Experimental results show that our approach outperforms several competitive methods.
What problem does this paper attempt to address?