Abstract:BACKGROUND Automatic text summarization (ATS) enables users to retrieve meaningful evidence from big data of biomedical repositories to make complex clinical decisions. Deep neural and recurrent networks outperform traditional machine-learning techniques in areas of natural language processing and computer vision; however, they are yet to be explored in the ATS domain, particularly for medical text summarization. OBJECTIVE Traditional approaches in ATS for biomedical text suffer from fundamental issues such as an inability to capture clinical context, quality of evidence, and purpose-driven selection of passages for the summary. We aimed to circumvent these limitations through achieving precise, succinct, and coherent information extraction from credible published biomedical resources, and to construct a simplified summary containing the most informative content that can offer a review particular to clinical needs. METHODS In our proposed approach, we introduce a novel framework, termed Biomed-Summarizer, that provides quality-aware Patient/Problem, Intervention, Comparison, and Outcome (PICO)-based intelligent and context-enabled summarization of biomedical text. Biomed-Summarizer integrates the prognosis quality recognition model with a clinical context–aware model to locate text sequences in the body of a biomedical article for use in the final summary. First, we developed a deep neural network binary classifier for quality recognition to acquire scientifically sound studies and filter out others. Second, we developed a bidirectional long-short term memory recurrent neural network as a clinical context–aware classifier, which was trained on semantically enriched features generated using a word-embedding tokenizer for identification of meaningful sentences representing PICO text sequences. Third, we calculated the similarity between query and PICO text sequences using Jaccard similarity with semantic enrichments, where the semantic enrichments are obtained using medical ontologies. Last, we generated a representative summary from the high-scoring PICO sequences aggregated by study type, publication credibility, and freshness score. RESULTS Evaluation of the prognosis quality recognition model using a large dataset of biomedical literature related to intracranial aneurysm showed an accuracy of 95.41% (2562/2686) in terms of recognizing quality articles. The clinical context–aware multiclass classifier outperformed the traditional machine-learning algorithms, including support vector machine, gradient boosted tree, linear regression, K-nearest neighbor, and naïve Bayes, by achieving 93% (16127/17341) accuracy for classifying five categories: aim, population, intervention, results, and outcome. The semantic similarity algorithm achieved a significant Pearson correlation coefficient of 0.61 (0-1 scale) on a well-known BIOSSES dataset (with 100 pair sentences) after semantic enrichment, representing an improvement of 8.9% over baseline Jaccard similarity. Finally, we found a highly positive correlation among the evaluations performed by three domain experts concerning different metrics, suggesting that the automated summarization is satisfactory. CONCLUSIONS By employing the proposed method Biomed-Summarizer, high accuracy in ATS was achieved, enabling seamless curation of research evidence from the biomedical literature to use for clinical decision-making.

Methodical Systematic Review of Abstractive Summarization and Natural Language Processing Models for Biomedical Health Informatics: Approaches, Metrics and Challenges

An Overview of Natural Language Processing Models for Abstractive Text Summarization.

Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation (Preprint)

Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review

A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models

Automated Lay Language Summarization of Biomedical Scientific Reviews

A Survey for Biomedical Text Summarization: From Pre-trained to Large Language Models

Clinical Context–Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation

Text Summarization Techniques Using Natural Language Processing: A Systematic Literature Review

Improving Biomedical Abstractive Summarisation with Knowledge Aggregation from Citation Papers

Synthesizing Scientific Summaries: An Extractive and Abstractive Approach

A survey of automated methods for biomedical text simplification

A Survey of the State-of-the-Art Models in Neural Abstractive Text Summarization

Abstractive summarization: An overview of the state of the art

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts

Text Summarization in the Biomedical Domain

A Survey on Neural Network-Based Summarization Methods

Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

Automatic Text Summarization Methods: A Comprehensive Review