Abstract:Summarization is an important natural language processing (NLP) task in identifying key information from text. For conversations, the summarization systems need to extract salient contents from spontaneous utterances by multiple speakers. In a special task-oriented scenario, namely medical conversations between patients and doctors, the symptoms, diagnoses, and treatments could be highly important because the nature of such conversation is to find a medical solution to the problem proposed by the patients. Especially consider that current online medical platforms provide millions of public available conversations between real patients and doctors, where the patients propose their medical problems and the registered doctors offer diagnosis and treatment, a conversation in most cases could be too long and the key information is hard to be located. Therefore, summarizations to the patients’ problems and the doctors’ treatments in the conversations can be highly useful, in terms of helping other patients with similar problems have a precise reference for potential medical solutions. In this paper, we focus on medical conversation summarization, using a dataset of medical conversations and corresponding summaries which were crawled from a well-known online healthcare service provider in China. We propose a hierarchical encoder-tagger model (HET) to generate summaries by identifying important utterances (with respect to problem proposing and solving) in the conversations. For the particular dataset used in this study, we show that high-quality summaries can be generated by extracting two types of utterances, namely, problem statements and treatment recommendations. Experimental results demonstrate that HET outperforms strong baselines and models from previous studies, and adding conversation-related features can further improve system performance.

A Factual Aware Two-Stage Model for Medical Dialogue Summarization.

Towards Efficient Medical Dialogue Summarization with Compacting-Abstractive Model.

MVP: Optimizing Multi-view Prompts for Medical Dialogue Summarization.

Enhanced Electronic Health Records Text Summarization Using Large Language Models

MedicalSum: A Guided Clinical Abstractive Summarization Model for Generating Medical Reports from Patient-Doctor Conversations

Generating medically-accurate summaries of patient-provider dialogue: A multi-stage approach using large language models

Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures

MEDSAGE: Enhancing Robustness of Medical Dialogue Summarization to ASR Errors with LLM-generated Synthetic Dialogues

Summarizing Medical Conversations via Identifying Important Utterances

Automatic analysis of medical dialogue in the home hemodialysis domain: Structure induction and summarization

Two eyes, Two views, and finally, One summary! Towards Multi-modal Multi-tasking Knowledge-Infused Medical Dialogue Summarization

Generate Descriptions of Medical Dialogues Through Two-Layers Transformer-Based Summarization

Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data

An Exploratory Study on Long Dialogue Summarization: What Works and What's Next

Extrinsically-Focused Evaluation of Omissions in Medical Summarization

Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review

CopiFilter: an Auxiliary Module Adapts Pre-trained Transformers for Medical Dialogue Summarization

CLINICSUM: Utilizing Language Models for Generating Clinical Summaries from Patient-Doctor Conversations

Extractive Dialogue Summarization Without Annotation Based on Distantly Supervised Machine Reading Comprehension in Customer Service

Adapting Large Language Models for Automated Summarisation of Electronic Medical Records in Clinical Coding

Medical Question Summarization with Entity-driven Contrastive Learning