RealMedDial: A Real Telemedical Dialogue Dataset Collected from Online Chinese Short-Video Clips.

Bo Xu,Hongtong Zhang,Jian Wang,Xiaokun Zhang,Dezhi Hao,Linlin Zong,Hongfei Lin,Fenglong Ma
2022-01-01
Abstract:Intelligent medical services have attracted great research interests for providing automated medical consultation. However, the lack of corpora becomes a main obstacle to related research, particularly data from real scenarios. In this paper, we construct RealMedDial, a Chinese medical dialogue dataset based on real medical consultation. RealMedDial contains 2,637 medical dialogues and 24,255 utterances obtained from Chinese short-video clips of real medical consultations. We collected and annotated a wide range of meta-data with respect to medical dialogue including doctor profiles, hospital departments, diseases and symptoms for fine-grained analysis on language usage pattern and clinical diagnosis. We evaluate the performance of medical response generation, department routing and doctor recommendation on RealMedDial. Results show that RealMedDial are applicable to a wide range of NLP tasks with respect to medical dialogue.
What problem does this paper attempt to address?