Efficiency at Scale: Investigating the Performance of Diminutive Language Models in Clinical Tasks

Niall Taylor,Upamanyu Ghose,Omid Rohanian,Mohammadmahdi Nouriborji,Andrey Kormilitzin,David Clifton,Alejo Nevado-Holgado
2024-02-16
Abstract:The entry of large language models (LLMs) into research and commercial spaces has led to a trend of ever-larger models, with initial promises of generalisability, followed by a widespread desire to downsize and create specialised models without the need for complete fine-tuning, using Parameter Efficient Fine-tuning (PEFT) methods. We present an investigation into the suitability of different PEFT methods to clinical decision-making tasks, across a range of model sizes, including extremely small models with as few as $25$ million parameters.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?