Abstract:Smart healthcare systems that make use of abundant health data can improve access to healthcare services, reduce medical costs and provide consistently high-quality patient care. Medical dialogue systems that generate medically appropriate and human-like conversations have been developed using various pre-trained language models and a large-scale medical knowledge base based on Unified Medical Language System (UMLS). However, most of the knowledge-grounded dialogue models only use local structure in the observed triples, which suffer from knowledge graph incompleteness and hence cannot incorporate any information from dialogue history while creating entity embeddings. As a result, the performance of such models decreases significantly. To address this problem, we propose a general method to embed the triples in each graph into large-scalable models and thereby generate clinically correct responses based on the conversation history using the recently recently released MedDialog(EN) dataset. Given a set of triples, we first mask the head entities from the triples overlapping with the patient's utterance and then compute the cross-entropy loss against the triples' respective tail entities while predicting the masked entity. This process results in a representation of the medical concepts from a graph capable of learning contextual information from dialogues, which ultimately aids in leading to the gold response. We also fine-tune the proposed Masked Entity Dialogue (MED) model on smaller corpora which contain dialogues focusing only on the Covid-19 disease named as the Covid Dataset. In addition, since UMLS and other existing medical graphs lack data-specific medical information, we re-curate and perform plausible augmentation of knowledge graphs using our newly created Medical Entity Prediction (MEP) model. Empirical results on the MedDialog(EN) and Covid Dataset demonstrate that our proposed model outperforms the state-of-the-art methods in terms of both automatic and human evaluation metrics.

Knowledge Distillation with Metric Learning for Medical Dialogue Generation

Bidirectional Distillation for Multi-Guidance Medical Dialogue Generation

Data Distillation for Controlling Specificity in Dialogue Generation.

Dynamic Curriculum Learning with Co-training for Medical Dialogue Generation

Combining Curriculum Learning and Knowledge Distillation for Dialogue Generation

PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning

Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation

MHKD-MVQA: Multimodal Hierarchical Knowledge Distillation for Medical Visual Question Answering.

Meta-Learning Adaptive Knowledge Distillation for Efficient Biomedical Natural Language Processing

Knowledge graph assisted end-to-end medical dialog generation

Research on Medical Dialogue Generation of External Knowledge

Knowledge grounded medical dialogue generation using augmented graphs

Distinct but correct: generating diversified and entity-revised medical response

MedChatZH: a Better Medical Adviser Learns from Better Instructions

MSKD: Structured knowledge distillation for efficient medical image segmentation

Medical Dialogue Generation via Dual Flow Modeling

Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation

MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding

DSG-KD: Knowledge Distillation from Domain-Specific to General Language Models

MKA: A Scalable Medical Knowledge Assisted Mechanism for Generative Models on Medical Conversation Tasks

Prompt-based Generative Approach towards Multi-Hierarchical Medical Dialogue State Tracking