Abstract:Automatic summarization plays an important role in the exponential document growth on the Web. On content websites such as <a class="link-external link-http" href="http://CNN.com" rel="external noopener nofollow">this http URL</a> and <a class="link-external link-http" href="http://WikiHow.com" rel="external noopener nofollow">this http URL</a>, there often exist various kinds of side information along with the main document for attention attraction and easier understanding, such as videos, images, and queries. Such information can be used for better summarization, as they often explicitly or implicitly mention the essence of the article. However, most of the existing side-aware summarization methods are designed to incorporate either single-modal or multi-modal side information, and cannot effectively adapt to each other. In this paper, we propose a general summarization framework, which can flexibly incorporate various modalities of side information. The main challenges in designing a flexible summarization model with side information include: (1) the side information can be in textual or visual format, and the model needs to align and unify it with the document into the same semantic space, (2) the side inputs can contain information from various aspects, and the model should recognize the aspects useful for summarization. To address these two challenges, we first propose a unified topic encoder, which jointly discovers latent topics from the document and various kinds of side information. The learned topics flexibly bridge and guide the information flow between multiple inputs in a graph encoder through a topic-aware interaction. We secondly propose a triplet contrastive learning mechanism to align the single-modal or multi-modal information into a unified semantic space, where the summary quality is enhanced by better understanding the document and side information. Results show that our model significantly surpasses strong baselines on three public single-modal or multi-modal benchmark summarization datasets.

Improving Unsupervised Extractive Summarization with Facet-Aware Modeling

Improving Unsupervised Extractive Summarization by Jointly Modeling Facet and Redundancy

Topic-Aware Modeling for Unsupervised Extractive Summarization

A Novel Feature-based Bayesian Model for Query Focused Multi-document Summarization

Query-focused Multi-document Summarization: Combining a Novel Topic Model with Graph-based Semi-supervised Learning

Exploring hypergraph-based semi-supervised ranking for query-oriented summarization

Improving Sentence Similarity Estimation for Unsupervised Extractive Summarization

An Efficient Coarse-to-Fine Facet-Aware Unsupervised Summarization Framework Based on Semantic Blocks.

Rethinking Scientific Summarization Evaluation: Grounding Explainable Metrics on Facet-aware Benchmark

A spectral analysis approach to document summarization: Clustering and ranking sentences simultaneously.

A Supervised Aggregation Framework for Multi-Document Summarization.

StarSum: A Star Architecture Based Model for Extractive Summarization

SEASum: Syntax-Enriched Abstractive Summarization

Towards A Unified Approach Based On Affinity Graph To Various Multi-Document Summarizations

A Topic-aware Summarization Framework with Different Modal Side Information

A Document-Sensitive Graph Model for Multi-Document Summarization

A Topic-sensitive Extractive Method for Multi-document Summarization

Automatically Extracting Summaries with a Novel Unsupervised Framework

Modeling Endorsement for Multi-Document Abstractive Summarization

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

An Integrated Graph Model For Document Summarization