Graph Based Collaborative Extraction Method for Keywords and Summary from Documents

余秦勇,黄少滨,毛湘科
DOI: https://doi.org/10.11896/jsjkx.200900082
2021-01-01
Computer Science
Abstract:The purpose of keywords extraction and summary extraction is to select key content from the original document to express the main meaning of the original document.The evaluation of keywords and summarization quality mainly depends on whether it can cover the main topics of the document.In the existing methods of keywords extraction and summary extraction based on graph models,it rarely involves the task of keywords extraction and summary extraction collaboratively.The article proposes a method based on a graph model for simultaneous keywords extraction and summary extraction.The method first uses the six relationships among words,topics,and sentences in the document,including words-words,topics-topics,sentences-sentences,words-topics,topics-sentences,words-sentences,to construct the graph;then uses the statistical characteristics of the words and sentences in the document to evaluate the prior importance of each vertex in the graph;next,it uses an iterative way to score words and sentences;finally,we get the final keywords and summary based on the scores of words and sentences.In order to verify the effectiveness of the proposed method,keywords extraction and summary extraction experiments are carried out on Chinese and English datasets.It is found that the proposed method achievs good results in both keywords extraction and summary extraction tasks.
What problem does this paper attempt to address?