Abstract:Multi-document summarization aims to create a condensed summary while retaining the main characteristics of the original set of documents. Under such background, sentence ranking has hitherto been the issue of most concern. Since documents often cover a number of topic themes with each theme represented by a cluster of highly related sentences, sentence clustering has been explored in the literature in order to provide more informative summaries. For each topic theme, the rank of terms conditional on this topic theme should be very distinct, and quite different from the rank of terms in other topic themes. Existing cluster-based summarization approaches apply clustering and ranking in isolation, which leads to incomplete, or sometimes rather biased, analytical results. A newly emerged framework uses sentence clustering results to improve or refine the sentence ranking results. Under this framework, we propose a novel approach that directly generates clusters integrated with ranking in this paper. The basic idea of the approach is that ranking distribution of sentences in each cluster should be quite different from each other, which may serve as features of clusters and new clustering measures of sentences can be calculated accordingly. Meanwhile, better clustering results can achieve better ranking results. As a result, ranking and clustering by mutually and simultaneously updating each other so that the performance of both can be improved. The effectiveness of the proposed approach is demonstrated by both the cluster quality analysis and the summarization evaluation conducted on the DUC 2004–2007 datasets.

A spectral analysis approach to document summarization: Clustering and ranking sentences simultaneously.

Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization

Simultaneous Ranking and Clustering of Sentences: A Reinforcement Approach to Multi-Document Summarization.

Enhancing sentence-level clustering with ranking-based clustering framework for theme-based summarization

Enhancing diversity and coverage of document summaries through subspace clustering and clustering-based optimization

Exploring hypergraph-based semi-supervised ranking for query-oriented summarization

Enhancing sentence-level clustering with integrated and interactive frameworks for theme-based summarization

Combining co-clustering with noise detection for theme-based summarization

Semi-supervised co-clustering for query-oriented theme-based summarization

Simultaneous Clustering and Noise Detection for Theme-based Summarization.

An Approach To Automatic Summarization For Chinese Text Based On The Combination Of Spectral Clustering And Lexrank

A Context-Sensitive Manifold Ranking Approach To Query-Focused Multi-Document Summarization

Multi-document Summarization via LDA and Density Peaks Based Sentence-Level Clustering

A Novel Chinese Multi-Document Summarization Using Clustering Based Sentence Extraction

Clustering Sentences with Density Peaks for Multi-document Summarization

Sentences clustering based automatic summarization

Multi-document summarization using cluster-based link analysis.

A Supervised Aggregation Framework for Multi-Document Summarization.

Multi-document Summarization Via Sentence-Level Semantic Analysis and Symmetric Matrix Factorization

Applying Two-Level Reinforcement Ranking in Query-Oriented Multidocument Summarization

Density Peaks Clustering Based Integrate Framework for Multi-Document Summarization.