DoctorGLM: Fine-tuning your Chinese Doctor is not a Herculean Task

Honglin Xiong,Sheng Wang,Yitao Zhu,Zihao Zhao,Yuxiao Liu,Linlin Huang,Qian Wang,Dinggang Shen
2023-04-18
Abstract:The recent progress of large language models (LLMs), including ChatGPT and GPT-4, in comprehending and responding to human instructions has been remarkable. Nevertheless, these models typically perform better in English and have not been explicitly trained for the medical domain, resulting in suboptimal precision in diagnoses, drug recommendations, and other medical advice. Additionally, training and deploying a dialogue model is still believed to be impossible for hospitals, hindering the promotion of LLMs. To tackle these challenges, we have collected databases of medical dialogues in Chinese with ChatGPT's help and adopted several techniques to train an easy-deploy LLM. Remarkably, we were able to fine-tune the ChatGLM-6B on a single A100 80G in 13 hours, which means having a healthcare-purpose LLM can be very affordable. DoctorGLM is currently an early-stage engineering attempt and contain various mistakes. We are sharing it with the broader community to invite feedback and suggestions to improve its healthcare-focused capabilities: <a class="link-external link-https" href="https://github.com/xionghonglin/DoctorGLM" rel="external noopener nofollow">this https URL</a>.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Improving the accuracy of language models in the medical field**: Existing large language models (such as ChatGPT and GPT-4) perform excellently in natural language processing but have shortcomings in medical diagnosis, drug recommendations, and other medical advice, especially in non-English contexts. 2. **Achieving multilingual support**: Most existing models are primarily trained for English, limiting their application in non-English-speaking countries, particularly in the medical field. 3. **Reducing costs and technical barriers**: Researchers aim to optimize model architecture and training methods so that medical institutions can deploy customized medical dialogue models at lower costs and with simpler technical means. By constructing a database specifically for Chinese medical dialogues and fine-tuning pre-trained models based on this dataset, researchers hope to develop a model—DoctorGLM—that can both understand complex medical knowledge and operate efficiently. This would improve the accuracy of medical advice and expand its application range. Additionally, the paper explores how parameter-efficient fine-tuning techniques and quantization methods can enable the model to run on low-cost hardware, further lowering the technical barriers in the medical field.