LDA-Based Topic Formation and Topic-Sentence Reinforcement for Graph-Based Multi-document Summarization

Dehong Gao,Wenjie Li,You Ouyang,Renxian Zhang
DOI: https://doi.org/10.1007/978-3-642-35341-3_33
2012-01-01
Abstract:In recent years graph-based ranking algorithms have attracted much attention in document summarization. This paper introduces our recent work on applying a topic model, namely LDA, in graph-based summarization. In the proposed approach, LDA is used to automatically identify a set of semantic topics from the documents to be summarized. The identified topics are then used to construct a bipartite graph to represent the documents. Topic-sentence reinforcement is implemented to calculate the salience scores of topics and sentences simultaneously. By incorporating the information embedded in the topics, the sentence ranking result can be improved. Experiments are conducted on the DUC 2004 data set to evaluate the effectiveness of the proposed approach.
What problem does this paper attempt to address?