Abstract:Compared to traditional RNN-based models, abstractive summarization systems based on Pre-trained Language Models (PTMs) achieve dramatic improvements in readability. Thus, in the field of abstractive summarization, more attention should be devoted to the faithfulness issue that predicted summaries are not factually consistent with source texts. To alleviate this disadvantage, we propose a novel Syntax-Enriched Abstractive Summarization (SEASum) framework, which utilizes graph attention networks (GATs) to introduce syntactic features of source texts to generate faithful summaries. In the SEASum framework, the PTM-based semantic encoder encodes word sequence, while the GAT-based syntactic encoder captures explicit syntax, i.e., part-of-speech tags, parse trees, and dependency-based relative positions of source documents. A feature fusion module is introduced to incorporate encoded syntactic features into the summarization framework. Based on the proposed SEASum framework, we develop two summarization models: 1) parallel SEASum model, in which the semantic encoder and syntactic encoder work in parallel, a multi-head attention module fused two-stream features for the following decoding process; 2) cascaded SEASum model, which takes contextual word embeddings from semantic encoder as node embeddings for the syntactic encoder and employs highway networks to regulate information flow. Experimental results on CNN/DailyMail and Reddit-TIFU (short) datasets show our parallel SEASum model and cascaded SEASum model outperform state-of-the-art abstractive summarization approaches in the faithfulness measurement. The results also demonstrate that cascaded SEASum model is more effective than parallel SEASum model in boosting faithfulness.

COVIDSum: A Linguistically Enriched SciBERT-based Summarization Model for COVID-19 Scientific Papers.

Amplifying Scientific Paper's Abstract by Leveraging Data-Weighted Reconstruction

Automatic Text Summarization of COVID-19 Medical Research Articles using BERT and GPT-2

CovSumm: an unsupervised transformer-cum-graph-based hybrid document summarization model for CORD-19

HetTreeSum: A Heterogeneous Tree Structure-based Extractive Summarization Model for Scientific Papers

Enhancing Scientific Papers Summarization with Citation Graph

Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers

Continual BERT: Continual Learning for Adaptive Extractive Summarization of COVID-19 Literature

SEASum: Syntax-Enriched Abstractive Summarization

HITS-based attentional neural model for abstractive summarization

CiteSum: Citation Text-guided Scientific Extreme Summarization and Domain Adaptation with Limited Supervision

Synthesizing Scientific Summaries: An Extractive and Abstractive Approach

uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

SciSummPip: An Unsupervised Scientific Paper Summarization Pipeline

ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks.

Enhancing Abstractive Summarization of Scientific Papers Using Structure Information

Biomedical-domain Pre-Trained Language Model for Extractive Summarization

Scientific document summarization via citation contextualization and scientific discourse

Automated Lay Language Summarization of Biomedical Scientific Reviews

CTRLsum: Towards Generic Controllable Text Summarization