BiMediX: Bilingual Medical Mixture of Experts LLM

Sara Pieri,Sahal Shaji Mullappilly,Fahad Shahbaz Khan,Rao Muhammad Anwer,Salman Khan,Timothy Baldwin,Hisham Cholakkal
DOI: https://doi.org/10.48550/arXiv.2402.13253
2024-02-20
Computation and Language
Abstract:In this paper, we introduce BiMediX, the first bilingual medical mixture of experts LLM designed for seamless interaction in both English and Arabic. Our model facilitates a wide range of medical interactions in English and Arabic, including multi-turn chats to inquire about additional details such as patient symptoms and medical history, multiple-choice question answering, and open-ended question answering. We propose a semi-automated English-to-Arabic translation pipeline with human refinement to ensure high-quality translations. We also introduce a comprehensive evaluation benchmark for Arabic medical LLMs. Furthermore, we introduce BiMed1.3M, an extensive Arabic-English bilingual instruction set covering 1.3 Million diverse medical interactions, resulting in over 632 million healthcare specialized tokens for instruction tuning. Our BiMed1.3M dataset includes 250k synthesized multi-turn doctor-patient chats and maintains a 1:2 Arabic-to-English ratio. Our model outperforms state-of-the-art Med42 and Meditron by average absolute gains of 2.5% and 4.1%, respectively, computed across multiple medical evaluation benchmarks in English, while operating at 8-times faster inference. Moreover, our BiMediX outperforms the generic Arabic-English bilingual LLM, Jais-30B, by average absolute gains of 10% on our Arabic medical benchmark and 15% on bilingual evaluations across multiple datasets. Our project page with source code and trained model is available at https://github.com/mbzuai-oryx/BiMediX .
What problem does this paper attempt to address?
The main problem this paper attempts to address is the development of a bilingual medical language model (BiMediX) that can seamlessly support both English and Arabic, thereby enhancing multilingual interaction capabilities in the medical field. Specifically, the paper addresses the following key issues: 1. **Support for Bilingual Medical Conversations**: Existing medical language models mostly support only a single language (usually English), which cannot meet the medical needs in multilingual environments. BiMediX aims to improve the quality and efficiency of cross-language medical conversations by supporting seamless switching between English and Arabic. 2. **High-Quality Translation and Dataset Construction**: To ensure the model's accuracy in a bilingual environment, the paper proposes a semi-automatic English-Arabic translation pipeline, combined with manual proofreading to generate high-quality translation data. Additionally, a bilingual dataset BiMed1.3M containing 1.3 million instructions has been constructed, covering various medical interaction scenarios such as multi-turn dialogues, multiple-choice answers, and open-ended questions. 3. **Performance Optimization and Evaluation**: The paper optimizes the Mixtral model using Parameter-Efficient Finetuning (PEFT) techniques to better adapt to medical tasks while maintaining high performance. Multiple benchmarks are introduced to evaluate BiMediX's performance in English and Arabic medical tasks, showing that BiMediX outperforms existing models on several metrics. 4. **Support for Resource-Limited Languages**: Arabic, as a resource-limited language, has relatively less application in the medical field. The paper addresses this gap by constructing a high-quality Arabic medical dataset and evaluation benchmarks, providing an important foundation for future research. In summary, by developing the BiMediX model, this paper aims to address key issues in multilingual medical conversations, enhancing the practicality and accuracy of medical language models in bilingual environments.