Automatic Abstracting System Based on Improved LexRank Algorithm

纪文倩,李舟军,巢文涵,陈小明
DOI: https://doi.org/10.3969/j.issn.1002-137X.2010.05.036
2010-01-01
Computer Science
Abstract:Automatic abstracting has been a priority research point in computational linguistics field,and the study and application of automatic summarization have widely attracted the attention of interrelated academic subjects such as computer science,linguistics,informatics.This article firstly brought out how LexRank algorithm works in automatic summarization,then improved the method in three aspects including sentence similarity computing,sentence weight computing and redundancy resolution.And the factors of influence could be dynamically adjusted according to the documents content.The system described in this article could deal with single or multi-document summarization both in English and Chinese.With evaluations on two corpuses,our methods could produce better summaries than the original LexRank algorithm to a certain degree.We also show that our system is quite insensitive to the noise in the data that may result from an imperfect topical clustering of documents.And in the end,existing problem and the developing trend of automatic summarization technology were discussed.
What problem does this paper attempt to address?