Chinese Webpage Keyword Extraction Based on Semantics Extension Model

WANG Yang,SHUAI Jian-mei
DOI: https://doi.org/10.3969/j.issn.1000-3428.2012.22.040
2012-01-01
Abstract:This paper presents a Chinese Webpage keyword extraction algorithm based on word extension model.It creates an evaluation function to transform term-document matrix by scoring candidate keyword based on its Web structure,part-of-speech,length,TF-IDF value,and uses the word extension model to extend the candidate keywords into key phrases which is based on the n-gram language model.Experimental results show that the proposed algorithm has better performance compared with the traditional keyword extraction algorithms.
What problem does this paper attempt to address?