DoctorGPT: A Large Language Model with Chinese Medical Question-Answering Capabilities

Min Wu,Jingyi Liu,Yanjie Li,Wenqiang Li,Meilan Hao,Lina Yu
DOI: https://doi.org/10.1109/HDIS60872.2023.10499472
2023-12-06
Abstract:Large Language Models (LLMs) have made incredible strides recently in understanding and reacting to user intents. However, these models typically excel in English and have not been specifically trained for medical applications, leading to suboptimal performance in responding to medical inquiries such as diagnostic queries and drug recommendations. In this paper, we propose DoctorGPT, a domain-specific large language model tailored for medical question-answering tasks. DoctorGPT leverages the open-source Baichuan2 as its foundational model, undergoes extensive pre-training on medical encyclopedic data to incorporate medical knowledge, and subsequently undergoes fine-tuning on a dataset consisting of two million medical instruction-dialogue pairs to enhance its question-answering capabilities. When compared to general-purpose large models, DoctorGPT demonstrates significant advantages in Chinese medical question-answerinz (O&A) tasks.
Computer Science,Medicine
What problem does this paper attempt to address?