Abstract:One key challenge in multi-document summarization is to capture the relations among input documents that distinguish between single document summarization (SDS) and multi-document summarization (MDS). Few existing MDS works address this issue. One effective way is to encode document positional information to assist models in capturing cross-document relations. However, existing MDS models, such as Transformer-based models, only consider token-level positional information. Moreover, these models fail to capture sentences' linguistic structure, which inevitably causes confusions in the generated summaries. Therefore, in this paper, we propose document-aware positional encoding and linguistic-guided encoding that can be fused with Transformer architecture for MDS. For document-aware positional encoding, we introduce a general protocol to guide the selection of document encoding functions. For linguistic-guided encoding, we propose to embed syntactic dependency relations into the dependency relation mask with a simple but effective non-linear encoding learner for feature learning. Extensive experiments show the proposed model can generate summaries with high quality.

What problem does this paper attempt to address?

This paper attempts to solve two main problems in multi - document summarization: 1. **Capturing cross - document relationships**: In the multi - document summarization (MDS) task, the model needs to be able to capture the relationships between different documents, which is a problem that single - document summarization (SDS) does not need to handle. Existing MDS models, such as Transformer - based models, usually only consider the word - level position information and ignore the document - level position information. This deficiency makes it difficult for the model to detect cross - document relationships, thus affecting the quality of the summary. 2. **Preserving syntactic structures**: Although existing MDS models can explicitly calculate the relationships between each word through the self - attention mechanism, they lack explicit support for syntactic structures. This will lead to the generated summary content being irrelevant or deviating from the original meaning. Therefore, how to preserve the syntactic structures of the source documents when generating summaries is also an important issue. To solve the above problems, the author proposes two encoding mechanisms: - **Document - aware Positional Encoding**: By introducing a general protocol to guide the selection of document encoding functions and combining document - level position information with word - level position information, to help the model better capture cross - document relationships. - **Linguistic - guided Encoding**: By embedding dependency relationships into the dependency relation mask and using a simple non - linear encoding learner for feature learning, to help the model preserve the correct dependency structures and grammatical associations when generating summaries. The combination of these two encoding mechanisms aims to improve the quality of multi - document summarization, making the generated summaries more fluent, information - rich and grammatically correct. Experimental results show that the proposed model outperforms existing baseline models on multiple evaluation metrics.

Document-aware Positional Encoding and Linguistic-guided Encoding for Abstractive Multi-document Summarization

Disentangling Specificity for Abstractive Multi-document Summarization

Rethinking Transformer-based Multi-document Summarization: An Empirical Investigation

Multi-document Summarization via Deep Learning Techniques: A Survey

Improving Abstractive Multi-document Summarization with Predicate-Argument Structure Extraction.

Deep Dependency Substructure-Based Learning for Multidocument Summarization.

Parallel Hierarchical Transformer with Attention Alignment for Abstractive Multi-Document Summarization

Disentangling Instructive Information from Ranked Multiple Candidates for Multi-Document Scientific Summarization

Modeling Endorsement for Multi-Document Abstractive Summarization

PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization

Adapting Neural Single-Document Summarization Model for Abstractive Multi-Document Summarization: A Pilot Study.

Towards a Neural Network Approach to Abstractive Multi-Document Summarization.

UPER: Boosting Multi-Document Summarization with an Unsupervised Prompt-based Extractor.

Abstractive Multi-Document Summarization Via Joint Learning with Single-Document Summarization.

Leveraging Graph to Improve Abstractive Multi-Document Summarization.

SgSum: Transforming Multi-document Summarization into Sub-graph Selection

An Unsupervised Multi-Document Summarization Framework Based on Neural Document Model.

Multi-Document Summarization Based On Two-Level Sparse Representation Model

Query-oriented unsupervised multi-document summarization via deep learning model

Large-Scale Multi-Document Summarization with Information Extraction and Compression

A deep learning framework for multi-document summarization using LSTM with improved Dingo Optimizer (IDO)