Abstract:Summarisation of research results in plain language is crucial for promoting public understanding of research findings. The use of Natural Language Processing to generate lay summaries has the potential to relieve researchers' workload and bridge the gap between science and society. The aim of this narrative literature review is to describe and compare the different text summarisation approaches used to generate lay summaries. We searched the databases Web of Science, Google Scholar, IEEE Xplore, Association for Computing Machinery Digital Library and arXiv for articles published until 6 May 2022. We included original studies on automatic text summarisation methods to generate lay summaries. We screened 82 articles and included eight relevant papers published between 2020 and 2021, all using the same dataset. The results show that transformer-based methods such as Bidirectional Encoder Representations from Transformers (BERT) and Pre-training with Extracted Gap-sentences for Abstractive Summarization (PEGASUS) dominate the landscape of lay text summarisation, with all but one study using these methods. A combination of extractive and abstractive summarisation methods in a hybrid approach was found to be most effective. Furthermore, pre-processing approaches to input text (e.g. applying extractive summarisation) or determining which sections of a text to include, appear critical. Evaluation metrics such as Recall-Oriented Understudy for Gisting Evaluation (ROUGE) were used, which do not consider readability. To conclude, automatic lay text summarisation is under-explored. Future research should consider long document lay text summarisation, including clinical trial reports, and the development of evaluation metrics that consider readability of the lay summary.

What problem does this paper attempt to address?

The paper attempts to address the problem of the ability and methods of automatic text summarization technology in generating concise and easy-to-understand research summaries for general readers (non-expert readers). Specifically, the study aims to describe and compare different natural language processing (NLP) techniques, especially transformer-based methods, in their application to generating summaries for the general public. The study focuses on the following core issues: 1. **What NLP techniques have been applied in the field of text summarization for the general public?** The study explores different text summarization methods, including extractive, abstractive, and hybrid methods, particularly how effective these methods are in generating summaries for the general public. 2. **How is the performance of text summarization models for the general public evaluated?** The study discusses various metrics used to evaluate the quality of automatic text summarization, such as ROUGE, but points out that these metrics often do not consider the readability of the summaries. 3. **Which methods for text summarization for the general public are the most effective?** According to the research results, hybrid strategies combining extractive and abstractive methods have been proven to be the most effective. Additionally, pre-trained transformer models, such as BERT and PEGASUS, have shown outstanding performance in generating high-quality summaries for the general public. 4. **What are the main challenges and future research directions in the current research field?** The study highlights some limitations in current research, such as the small size of datasets potentially leading to overfitting issues, and the need to develop more evaluation metrics that consider the readability of summaries. Future research should explore the generation of summaries for long documents, including clinical trial reports, and continue to optimize evaluation methods. By discussing these issues, the study hopes to support the effective dissemination of scientific research results to the public, enabling non-experts to understand complex scientific concepts and research findings, thereby enhancing societal awareness and support for scientific research.

Lay Text Summarisation Using Natural Language Processing: A Narrative Literature Review

Automated Lay Language Summarization of Biomedical Scientific Reviews

Text Summarization Techniques Using Natural Language Processing: A Systematic Literature Review

Synthesizing Scientific Summaries: An Extractive and Abstractive Approach

Methodical Systematic Review of Abstractive Summarization and Natural Language Processing Models for Biomedical Health Informatics: Approaches, Metrics and Challenges

RAG-RLRC-LaySum at BioLaySumm: Integrating Retrieval-Augmented Generation and Readability Control for Layman Summarization of Biomedical Texts

Leveraging artificial intelligence to summarize abstracts in lay language for increasing research accessibility and transparency

Automatic Text Summarization Methods: A Comprehensive Review

Is AI ready to mass-produce lay summaries of research articles?

Generating (Factual?) Narrative Summaries of RCTs: Experiments with Neural Multi-Document Summarization

The Lay Person's Guide to Biomedicine: Orchestrating Large Language Models

Natural Language Processing based Abstractive Text Summarization of Reviews

A Novel Approach to Text Summarization Using Machine Learning

Summaformers @ LaySumm 20, LongSumm 20

Comparative Analysis of Semantic and Syntactic Approaches in Automatic Text Summarization: A Comprehensive Review and Evaluation

NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization

LitSumm: Large language models for literature summarisation of non-coding RNAs

An Overview of Natural Language Processing Models for Abstractive Text Summarization.

Assessment of Transformer-Based Encoder-Decoder Model for Human-Like Summarization

Abstractive summarization: An overview of the state of the art

Automatic News Summerization