A tutorial on open-source large language models for behavioral science

Zak Hussain,Marcel Binz,Rui Mata,Dirk U. Wulff
DOI: https://doi.org/10.3758/s13428-024-02455-8
IF: 5.953
2024-08-18
Behavior Research Methods
Abstract:Large language models (LLMs) have the potential to revolutionize behavioral science by accelerating and improving the research cycle, from conceptualization to data analysis. Unlike closed-source solutions, open-source frameworks for LLMs can enable transparency, reproducibility, and adherence to data protection standards, which gives them a crucial advantage for use in behavioral science. To help researchers harness the promise of LLMs, this tutorial offers a primer on the open-source Hugging Face ecosystem and demonstrates several applications that advance conceptual and empirical work in behavioral science, including feature extraction, fine-tuning of models for prediction, and generation of behavioral responses. Executable code is made available at github.com/Zak-Hussain/LLM4BeSci.git. Finally, the tutorial discusses challenges faced by research with (open-source) LLMs related to interpretability and safety and offers a perspective on future research at the intersection of language modeling and behavioral science.
psychology, experimental, mathematical
What problem does this paper attempt to address?