Enhancing Healthcare through Large Language Models: A Study on Medical Question Answering

Haoran Yu,Chang Yu,Zihan Wang,Dongxian Zou,Hao Qin
2024-08-08
Abstract:In recent years, the application of Large Language Models (LLMs) in healthcare has shown significant promise in improving the accessibility and dissemination of medical knowledge. This paper presents a detailed study of various LLMs trained on the MedQuAD medical question-answering dataset, with a focus on identifying the most effective model for providing accurate medical information. Among the models tested, the Sentence-t5 combined with Mistral 7B demonstrated superior performance, achieving a precision score of 0.762. This model's enhanced capabilities are attributed to its advanced pretraining techniques, robust architecture, and effective prompt construction methodologies. By leveraging these strengths, the Sentence-t5 + Mistral 7B model excels in understanding and generating precise medical answers. Our findings highlight the potential of integrating sophisticated LLMs in medical contexts to facilitate efficient and accurate medical knowledge retrieval, thus significantly enhancing patient education and support.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The main objective of this paper is to improve information retrieval and dissemination in the healthcare field by leveraging large language models (LLMs). Specifically, the research focuses on the medical question-answering task and aims to determine which language model configuration is best suited to provide accurate medical information. To achieve this goal, the researchers trained and evaluated several different large language models on the MedQuAD dataset, including Gemma 2b combined with LoRA, Phi-2, and Sentence-t5 combined with Mistral 7B. These models underwent meticulous data preprocessing, prompt construction, and fine-tuning processes to optimize performance. The study found that the Sentence-t5 combined with Mistral 7B model performed the best, achieving an accuracy score of 0.762. This indicates that the model can effectively understand and generate precise medical answers. In this way, the research demonstrates the potential of integrating advanced large language models into medical scenarios, thereby facilitating efficient and accurate medical knowledge retrieval, which significantly enhances the quality of patient education and support. In summary, this paper attempts to address the issue of how to utilize large language models to improve the accuracy and accessibility of medical information, particularly in answering patients' medical questions.