Abstract:A proficient summarization model should exhibit both flexibility -- the capacity to handle a range of in-domain summarization tasks, and adaptability -- the competence to acquire new knowledge and adjust to unseen out-of-domain tasks. Unlike large language models (LLMs) that achieve this through parameter scaling, we propose a more parameter-efficient approach in this study. Our motivation rests on the principle that the general summarization ability to capture salient information can be shared across different tasks, while the domain-specific summarization abilities need to be distinct and tailored. Concretely, we propose MoeSumm, a Mixture-of-Expert Summarization architecture, which utilizes a main expert for gaining the general summarization capability and deputy experts that selectively collaborate to meet specific summarization task requirements. We further propose a max-margin loss to stimulate the separation of these abilities. Our model's distinct separation of general and domain-specific summarization abilities grants it with notable flexibility and adaptability, all while maintaining parameter efficiency. MoeSumm achieves flexibility by managing summarization across multiple domains with a single model, utilizing a shared main expert and selected deputy experts. It exhibits adaptability by tailoring deputy experts to cater to out-of-domain few-shot and zero-shot scenarios. Experimental results on 11 datasets show the superiority of our model compared with recent baselines and LLMs. We also provide statistical and visual evidence of the distinct separation of the two abilities in MoeSumm (<a class="link-external link-https" href="https://github.com/iriscxy/MoE_Summ" rel="external noopener nofollow">this https URL</a>).

UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization

UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning

UniSumm and SummZoo: Unified Model and Diverse Benchmark for Few-Shot Summarization

UniSumEval: Towards Unified, Fine-Grained, Multi-Dimensional Summarization Evaluation for LLMs

UserSumBench: A Benchmark Framework for Evaluating User Summarization Approaches

InheritSumm: A General, Versatile and Compact Summarizer by Distilling from GPT

Multisumm: Towards A Unified Model For Multi-Lingual Abstractive Summarization

TGSum: Build Tweet Guided Multi-Document Summarization Dataset

UniMS: A Unified Framework for Multimodal Summarization with Knowledge Distillation

Information-Theoretic Distillation for Reference-less Summarization

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

Summ^N: A Multi-Stage Summarization Framework for Long Input Dialogues and Documents

LMGQS: A Large-scale Dataset for Query-focused Summarization

GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization

Flexible and Adaptable Summarization via Expertise Separation

CDEvalSumm: an Empirical Study of Cross-Dataset Evaluation for Neural Summarization Systems

SummScore: A Comprehensive Evaluation Metric for Summary Quality Based on Cross-Encoder

SummEval: Re-evaluating Summarization Evaluation

QTSumm: Query-Focused Summarization over Tabular Data

GUMSum: Multi-Genre Data and Evaluation for English Abstractive Summarization