Abstract:Automatic summarization plays an important role in the exponential document growth on the Web. On content websites such as <a class="link-external link-http" href="http://CNN.com" rel="external noopener nofollow">this http URL</a> and <a class="link-external link-http" href="http://WikiHow.com" rel="external noopener nofollow">this http URL</a>, there often exist various kinds of side information along with the main document for attention attraction and easier understanding, such as videos, images, and queries. Such information can be used for better summarization, as they often explicitly or implicitly mention the essence of the article. However, most of the existing side-aware summarization methods are designed to incorporate either single-modal or multi-modal side information, and cannot effectively adapt to each other. In this paper, we propose a general summarization framework, which can flexibly incorporate various modalities of side information. The main challenges in designing a flexible summarization model with side information include: (1) the side information can be in textual or visual format, and the model needs to align and unify it with the document into the same semantic space, (2) the side inputs can contain information from various aspects, and the model should recognize the aspects useful for summarization. To address these two challenges, we first propose a unified topic encoder, which jointly discovers latent topics from the document and various kinds of side information. The learned topics flexibly bridge and guide the information flow between multiple inputs in a graph encoder through a topic-aware interaction. We secondly propose a triplet contrastive learning mechanism to align the single-modal or multi-modal information into a unified semantic space, where the summary quality is enhanced by better understanding the document and side information. Results show that our model significantly surpasses strong baselines on three public single-modal or multi-modal benchmark summarization datasets.

A Supervised Aggregation Framework for Multi-Document Summarization.

Exploring hypergraph-based semi-supervised ranking for query-oriented summarization

Query-focused Multi-document Summarization: Combining a Novel Topic Model with Graph-based Semi-supervised Learning

Automatic Document Summarization Via Deep Neural Networks

Towards A Unified Approach Based On Affinity Graph To Various Multi-Document Summarizations

Manifold-Ranking Based Topic-Focused Multi-Document Summarization

SentTopic-MultiRank: a Novel Ranking Model for Multi-Document Summarization.

Towards a Unified Approach to Simultaneous Single-Document and Multi-Document Summarizations

Automatic Topic-oriented Multi-document Summarization with Combination of Query-dependent and Query-independent Rankers

HyperSum: hypergraph based semi-supervised sentence ranking for query-oriented summarization.

A Topic-aware Summarization Framework with Different Modal Side Information

Automatic multi-document summarization based on new sentence similarity measures

A New Approach for Multi-Document Update Summarization

Large-Scale Multi-Document Summarization with Information Extraction and Compression

CollabSum: exploiting multiple document clustering for collaborative single document summarizations.

Query-oriented unsupervised multi-document summarization via deep learning model

A Topic-sensitive Extractive Method for Multi-document Summarization

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

A Novel Relational Learning-To-Rank Approach For Topic-Focused Multi-Document Summarization

Multi-Document Summarization Based On Two-Level Sparse Representation Model

Mining Both Commonality and Specificity From Multiple Documents for Multi-Document Summarization