Automatic Keyword Extraction Based on Phrase Network

Li Guangyi,Wang Houfeng
2014-01-01
Abstract:Keyword extraction is an important task for Information Retrieval. For the task of keyword extraction on Chinese theses, this paper presents a ranking method based on phrase network. First, extract keyword candidates by DF-AV. Second, build phrase network based on the abstract of the thesis and extract keywords by TextRank. Finally, improve keyword extraction with Perceptron reranking. Our experiments on various categories of theses prove our method effective. The top 5, top 10, top 15 F values on test data are 27.96%、27.22%、24.07% respectively.
What problem does this paper attempt to address?