Analyzing evaluation methods for large language models in the medical field: a scoping review

Junbok Lee,Sungkyung Park,Jaeyong Shin,Belong Cho
DOI: https://doi.org/10.1186/s12911-024-02709-7
IF: 3.298
2024-12-02
BMC Medical Informatics and Decision Making
Abstract:Owing to the rapid growth in the popularity of Large Language Models (LLMs), various performance evaluation studies have been conducted to confirm their applicability in the medical field. However, there is still no clear framework for evaluating LLMs.
medical informatics
What problem does this paper attempt to address?