Abstract:Automatic summarization plays an important role in the exponential document growth on the Web. On content websites such as <a class="link-external link-http" href="http://CNN.com" rel="external noopener nofollow">this http URL</a> and <a class="link-external link-http" href="http://WikiHow.com" rel="external noopener nofollow">this http URL</a>, there often exist various kinds of side information along with the main document for attention attraction and easier understanding, such as videos, images, and queries. Such information can be used for better summarization, as they often explicitly or implicitly mention the essence of the article. However, most of the existing side-aware summarization methods are designed to incorporate either single-modal or multi-modal side information, and cannot effectively adapt to each other. In this paper, we propose a general summarization framework, which can flexibly incorporate various modalities of side information. The main challenges in designing a flexible summarization model with side information include: (1) the side information can be in textual or visual format, and the model needs to align and unify it with the document into the same semantic space, (2) the side inputs can contain information from various aspects, and the model should recognize the aspects useful for summarization. To address these two challenges, we first propose a unified topic encoder, which jointly discovers latent topics from the document and various kinds of side information. The learned topics flexibly bridge and guide the information flow between multiple inputs in a graph encoder through a topic-aware interaction. We secondly propose a triplet contrastive learning mechanism to align the single-modal or multi-modal information into a unified semantic space, where the summary quality is enhanced by better understanding the document and side information. Results show that our model significantly surpasses strong baselines on three public single-modal or multi-modal benchmark summarization datasets.

Topic-Aware Modeling for Unsupervised Extractive Summarization

Query-focused Multi-document Summarization: Combining a Novel Topic Model with Graph-based Semi-supervised Learning

An Unsupervised Video Summarization Method Based on Multimodal Representation.

Topic-Aware Abstractive Text Summarization

GATSum: Graph-Based Topic-Aware Abstract Text Summarization

A Topic-sensitive Extractive Method for Multi-document Summarization

A Topic-aware Summarization Framework with Different Modal Side Information

Topic-Centric Unsupervised Multi-Document Summarization of Scientific and News Articles

Topic-Guided Abstractive Text Summarization: a Joint Learning Approach

Towards A Unified Approach Based On Affinity Graph To Various Multi-Document Summarizations

Topic Modeling Based Text Summarization Approach

T-BERTSum: Topic-Aware Text Summarization Based on BERT

Topic Analysis for Topic-Focused Multi-Document Summarization

The THU Summarization Systems at TAC 2010.

Topic Aspect-Oriented Summarization Via Group Selection

A Supervised Aggregation Framework for Multi-Document Summarization.

Query-Focused Summarization by Combining Topic Model and Affinity Propagation

Automatic Labeling of Topic Models Using Text Summaries

A New Sentence Extraction Strategy for Unsupervised Extractive Summarization Methods

AttSum: Joint Learning of Focusing and Summarization with Neural Attention.

Query-focused Multi-Document Summarization: Combining a Topic Model with Graph-based Semi-supervised Learning.