ThaiNutriChat: development of a Thai large language model-based chatbot for health food services

Thananan Luangaphirom,Lojrutai Jocknoi,Chalermchai Wunchum,Kittitee Chokerungreang,Thitirat Siriborvornratanakul
DOI: https://doi.org/10.1007/s00530-024-01495-6
IF: 3.9
2024-09-30
Multimedia Systems
Abstract:Thailand is facing a shortage of medical personnel to provide health consultations to the public, especially in the area of Non-Communicable Diseases (NCDs), which include diseases related to lifestyle and dietary habits. With advancements in technology, there have been efforts to develop Chatbots to assist in answering questions and providing health-related information. In this work, the ThaiNutriChat Chatbot was developed for health consultation on NCDs, and the effectiveness of the Large Language Model (LLM) was evaluated by two forms of dataset collection, academic journal data from Thai health organizations and a set of 1000 Q&A pairs using the text recognition method via Tesseract OCR and augmenting the dataset from Typologically Diverse Question Answering (TyDi QA). Following that, ThaiNutriChat incorporated the dataset into the LLM refinement process through three main stages: (1) Retrieval Augmented Generation (RAG), (2) Fine-tuning using the Low-Rank Adaptation of Large Language Models (LoRA) technique, and (3) Employing a combination of RAG and fine-tuning. Subsequently, the effectiveness of ThaiNutriChat in all three configurations was assessed by medical experts, and divided into two domains: general knowledge about NCDs and specific knowledge related to NCDs. The performance accuracy in answering questions was compared between ThaiNutriChat and other Thai language models that support the same set of questions. The results showed that ThaiNutriChat has higher accuracy in answering questions related to specific knowledge related to NCDs compared to other Thai language models.
computer science, information systems, theory & methods
What problem does this paper attempt to address?