Studying large language models as compression algorithms for human culture

Nicholas Buttrick
DOI: https://doi.org/10.1016/j.tics.2024.01.001
IF: 19.9
2024-01-20
Trends in Cognitive Sciences
Abstract:Large language models (LLMs) extract and reproduce the statistical regularities in their training data. Researchers can use these models to study the conceptual relationships encoded in this training data (i.e., the open internet), providing a remarkable opportunity to understand the cultural distinctions embedded within much of recorded human communication.
behavioral sciences,psychology, experimental,neurosciences
What problem does this paper attempt to address?