Abstract:Automatic summarization generates concise summaries that contain key ideas of source documents. As the most mainstream datasets for the news sub-domain, CNN/DailyMail and BBC XSum have been widely used for performance benchmarking. However, the reference summaries of those datasets turn out to be noisy, mainly in terms of factual hallucination and information redundancy. To address this challenge, we first annotate new expert-writing Element-aware test sets following the "Lasswell Communication Model" proposed by Lasswell (1948), allowing reference summaries to focus on more fine-grained news elements objectively and comprehensively. Utilizing the new test sets, we observe the surprising zero-shot summary ability of LLMs, which addresses the issue of the inconsistent results between human preference and automatic evaluation metrics of LLMs' zero-shot summaries in prior work. Further, we propose a Summary Chain-of-Thought (SumCoT) technique to elicit LLMs to generate summaries step by step, which helps them integrate more fine-grained details of source documents into the final summaries that correlate with the human writing mindset. Experimental results show our method outperforms state-of-the-art fine-tuned PLMs and zero-shot LLMs by +4.33/+4.77 in ROUGE-L on the two datasets, respectively. Dataset and code are publicly available at <a class="link-external link-https" href="https://github.com/Alsace08/SumCoT" rel="external noopener nofollow">this https URL</a>.

Balancing Lexical and Semantic Quality in Abstractive Summarization

Abstractive Summarization Improved by WordNet-based Extractive Sentences

Dual-Level Contrastive Learning for Improving Conciseness of Summarization

Summary-Sentence Level Hierarchical Supervision for Re-Ranking Model of Two-Stage Abstractive Summarization Framework

Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought Method

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting

Monotonic Alignments for Summarization

A New Approach to Overgenerating and Scoring Abstractive Summaries

Improving Sequence-to-Sequence Models for Abstractive Text Summarization Using Meta Heuristic Approaches

Sentence salience contrastive learning for abstractive text summarization

DCDSum: An interpretable extractive summarization framework based on contrastive learning method

Exploring Explainable Selection to Control Abstractive Summarization

A Syntax-Augmented and Headline-Aware Neural Text Summarization Method

RetrievalSum: A Retrieval Enhanced Framework for Abstractive Summarization

AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference

Abstractive text summarization model combining a hierarchical attention mechanism and multiobjective reinforcement learning

AugSumm: towards generalizable speech summarization using synthetic labels from large language model

What Have We Achieved on Text Summarization?

Topic-Guided Abstractive Text Summarization: a Joint Learning Approach

Analysis of Multidomain Abstractive Summarization Using Salience Allocation