CollectiveSFT: Scaling Large Language Models for Chinese Medical Benchmark with Collective Instructions in Healthcare

Jingwei Zhu,Minghuan Tan,Min Yang,Ruixue Li,Hamid Alinejad-Rokny
2024-09-28
Abstract:The rapid progress in Large Language Models (LLMs) has prompted the creation of numerous benchmarks to evaluate their <a class="link-external link-http" href="http://capabilities.This" rel="external noopener nofollow">this http URL</a> study focuses on the Comprehensive Medical Benchmark in Chinese (CMB), showcasing how dataset diversity and distribution in supervised fine-tuning (SFT) may enhance LLM <a class="link-external link-http" href="http://performance.Remarkably" rel="external noopener nofollow">this http URL</a>, We successfully trained a smaller base model to achieve scores comparable to larger models, indicating that a diverse and well-distributed dataset can optimize performance regardless of model <a class="link-external link-http" href="http://size.This" rel="external noopener nofollow">this http URL</a> study suggests that even smaller models may reach high performance levels with carefully curated and varied datasets. By integrating a wide range of instructional content, our approach addresses potential issues such as data quality inconsistencies. Our results imply that a broader spectrum of training data may enhance a model's ability to generalize and perform effectively across different medical scenarios, highlighting the importance of dataset quality and diversity in fine-tuning processes. We open-source the model for future research at <a class="link-external link-https" href="https://github.com/CAS-SIAT-XinHai/CollectiveSFT" rel="external noopener nofollow">this https URL</a>
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?