Abstract:Text summarization is a downstream natural language processing (NLP) task that challenges the understanding and generation capabilities of language models. Considerable progress has been made in automatically summarizing short texts, such as news articles, often leading to satisfactory results. However, summarizing long documents remains a major challenge. This is due to the complex contextual information in the text and the lack of open-source benchmarking datasets and evaluation frameworks that can be used to develop and test model performance. In this work, we use ChatGPT, the latest breakthrough in the field of large language models (LLMs), together with the extractive summarization model C2F-FAR (Coarse-to-Fine Facet-Aware Ranking) to propose a hybrid extraction and summarization pipeline for long documents such as business articles and books. We work with the world-renowned company getAbstract AG and leverage their expertise and experience in professional book summarization. A practical study has shown that machine-generated summaries can perform at least as well as human-written summaries when evaluated using current automated evaluation metrics. However, a closer examination of the texts generated by ChatGPT through human evaluations has shown that there are still critical issues in terms of text coherence, faithfulness, and style. Overall, our results show that the use of ChatGPT is a very promising but not yet mature approach for summarizing long documents and can at best serve as an inspiration for human editors. We anticipate that our work will inform NLP researchers about the extent to which ChatGPT's capabilities for summarizing long documents overlap with practitioners' needs. Further work is needed to test the proposed hybrid summarization pipeline, in particular involving GPT-4, and to propose a new evaluation framework tailored to the task of summarizing long documents.

Chinese Judicial Summarising Based On Short Sentence Extraction And Gpt-2

Abstractive Automatic Summarizing Model for Legal Judgment Documents

Low-Resource Court Judgment Summarization for Common Law Systems

Legal Case Document Summarization: Extractive and Abstractive Methods and their Evaluation

How Ready are Pre-trained Abstractive Models and LLMs for Legal Case Judgement Summarization?

Legal Summarization for Multi-role Debate Dialogue via Controversy Focus Mining and Multi-task Learning

LawSum: A weakly supervised approach for Indian Legal Document Summarization

Applicability of Large Language Models and Generative Models for Legal Case Judgement Summarization

Legal Extractive Summarization of U.S. Court Opinions

Extractive Summarization via ChatGPT for Faithful Summary Generation

How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing

Hybrid Long Document Summarization using C2F-FAR and ChatGPT: A Practical Study

Exploring the Limits of ChatGPT for Query or Aspect-based Text Summarization

Indian Legal Text Summarization: A Text Normalisation-based Approach

AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation

On the Evaluation of Neural Code Summarization

Question-Answering Approach to Evaluating Legal Summaries

CJRC: A Reliable Human-Annotated Benchmark DataSet for Chinese Judicial Reading Comprehension

Unlocking Legal Knowledge: A Multilingual Dataset for Judicial Summarization in Switzerland

LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English

An Efficient Approach to Learning Chinese Judgment Document Similarity Based on Knowledge Summarization