Using a Double Clustering Approach to Build Extractive Multi-document Summaries

Sara Botelho Silveira,António Branco
DOI: https://doi.org/10.1007/978-3-642-32790-2_36
2012-01-01
Abstract:This paper presents a method for extractive multi-document summarization that explores a two-phase clustering approach. First, sentences are clustered by similarity, and one sentence per cluster is selected, to reduce redundancy. Then, in order to group them according to topics, those sentences are clustered considering the collection of keywords that represent the topics in the set of texts. Evaluation reveals that the approach pursued produces highly informative summaries, containing many relevant data and no repeated information.
What problem does this paper attempt to address?