Abstract:Topic-focused multi-document summarization aims to produce a summary over a set of documents and conveys the most important aspects of a given topic. Most existing extractive methods view the task as a multi-criteria ranking problem over sentences, where relevance, salience and diversity are three typical requirements. However, diversity is a challenging problem as it involves modeling the relationship between sentences during ranking, where traditional methods usually tackle it in a heuristic or implicit way. In this paper, we propose a novel relational learning-to-rank approach (R-LTR) to solve this problem. Relational learning-to-rank is a new learning framework which further incorporates relationships into traditional learning-to-rank in an elegant way. Specifically, the ranking function is defined as the combination of content-based score of individual sentence, and relation-based score between the current sentence and those already selected. On this basis, we propose to learn the ranking function by minimizing the likelihood loss based on Plackett-Luce model, which can naturally model the sequential ranking procedure of candidate sentences. Stochastic gradient descent is then employed to conduct the learning process, and the summary is predicted by the greedy selection procedure based on the learned ranking function. Finally, we conduct extensive experiments on benchmark data sets TAC2008 and TAC2009. Experimental results show that our approach can significantly outperform the state-of-the-art methods from both quantitative and qualitative aspects.

SentTopic-MultiRank: a Novel Ranking Model for Multi-Document Summarization.

Query-focused Multi-document Summarization: Combining a Novel Topic Model with Graph-based Semi-supervised Learning

Exploring hypergraph-based semi-supervised ranking for query-oriented summarization

A Novel Feature-based Bayesian Model for Query Focused Multi-document Summarization

Generic multi-document summarization using topic-oriented information

A Novel Relational Learning-To-Rank Approach For Topic-Focused Multi-Document Summarization

Exploring simultaneous keyword and key sentence extraction: improve graph-based ranking using wikipedia.

Exploring Simultaneous Keyword and Key Sentence Extraction

Double-Hypergraph Based Sentence Ranking for Query-Focused Multi-document Summarizaton

Manifold-Ranking Based Topic-Focused Multi-Document Summarization

Unsupervised Summarization by Jointly Extracting Sentences and Keywords

A Topic-sensitive Extractive Method for Multi-document Summarization

A Supervised Aggregation Framework for Multi-Document Summarization.

A comparative study on ranking and selection strategies for multi-document summarization

Subtopic-Based Multimodality Ranking for Topic-Focused Multidocument Summarization.

A Novel Biased Diversity Ranking Model for Query-Oriented Multi-Document Summarization

Query-focused Multi-Document Summarization: Combining a Topic Model with Graph-based Semi-supervised Learning.

Automatic Topic-oriented Multi-document Summarization with Combination of Query-dependent and Query-independent Rankers

Topic Analysis for Topic-Focused Multi-Document Summarization

RelationListwise for Query-Focused Multi-Document Summarization.

SRRank: Leveraging Semantic Roles for Extractive Multi-Document Summarization