Abstract:The human brain has long inspired the pursuit of artificial intelligence (AI). Recently, neuroimaging studies provide compelling evidence of alignment between the computational representation of artificial neural networks (ANNs) and the neural responses of the human brain to stimuli, suggesting that ANNs may employ brain-like information processing strategies. While such alignment has been observed across sensory modalities--visual, auditory, and linguistic--much of the focus has been on the behaviors of artificial neurons (ANs) at the population level, leaving the functional organization of individual ANs that facilitates such brain-like processes largely unexplored. In this study, we bridge this gap by directly coupling sub-groups of artificial neurons with functional brain networks (FBNs), the foundational organizational structure of the human brain. Specifically, we extract representative patterns from temporal responses of ANs in large language models (LLMs), and use them as fixed regressors to construct voxel-wise encoding models to predict brain activity recorded by functional magnetic resonance imaging (fMRI). This framework links the AN sub-groups to FBNs, enabling the delineation of brain-like functional organization within LLMs. Our findings reveal that LLMs (BERT and Llama 1-3) exhibit brain-like functional architecture, with sub-groups of artificial neurons mirroring the organizational patterns of well-established FBNs. Notably, the brain-like functional organization of LLMs evolves with the increased sophistication and capability, achieving an improved balance between the diversity of computational behaviors and the consistency of functional specializations. This research represents the first exploration of brain-like functional organization within LLMs, offering novel insights to inform the development of artificial general intelligence (AGI) with human brain principles.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to explore whether there are functional organization structures similar to the human brain inside large - language models (LLMs). Specifically, researchers hope to understand whether subgroups of neurons in LLMs can exhibit specific functional organization patterns like functional brain networks (FBNs) in the human brain when processing language tasks. By directly coupling subgroups of artificial neurons (ANs) in LLMs with FBNs, researchers aim to reveal whether LLMs can adopt information - processing strategies similar to the human brain, thus providing new insights for the development of artificial general intelligence (AGI) based on the principles of the human brain. To achieve this goal, researchers adopted the following methods: 1. **Define artificial neurons and their time responses**: First, define the artificial neurons in LLMs and quantify their time responses to input text sequences. 2. **Learn representative time - response patterns**: Use a sparse representation scheme to learn a set of representative time - response patterns (dictionary \( D_{AN} \)) from the time responses of a large number of ANs. 3. **Construct voxel - level encoding models**: Use the learned dictionary \( D_{AN} \)) as regressors to construct voxel - level encoding models to predict brain activities recorded by fMRI. 4. **Infer the relationship between ANs and brain networks**: Analyze the results of voxel - level encoding models to infer the relationship between subgroups of ANs and specific brain networks. The study found that LLMs (such as BERT and Llama 1 - 3) do indeed exhibit brain - like functional architectures, and subgroups of artificial neurons therein can reflect the known organizational patterns of FBNs. As the complexity and capabilities of the models increase, this brain - like functional organization becomes more prominent, achieving a balance between computational behavior diversity and functional specialization. This study is the first to systematically explore the brain - like functional organization inside LLMs, providing an important theoretical basis for the future development of AI systems based on brain principles.

Brain-like Functional Organization within Large Language Models

Brain-Like Language Processing via a Shallow Untrained Multihead Attention Network

Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

What Are Large Language Models Mapping to in the Brain? A Case Against Over-Reliance on Brain Scores

Unveiling A Core Linguistic Region in Large Language Models

Do Large Language Models Mirror Cognitive Language Processing?

On the Shape of Brainscores for Large Language Models (LLMs)

Coupling Artificial Neurons in BERT and Biological Neurons in the Human Brain

ALIGNING BRAINS INTO A SHARED SPACE IMPROVES THEIR ALIGNMENT TO LARGE LANGUAGE MODELS

Large language models surpass human experts in predicting neuroscience results

Visual representations in the human brain are aligned with large language models

Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models

BrainLM: A foundation model for brain activity recordings

Scale matters: Large language models with billions (rather than millions) of parameters better match neural representations of natural language

Human-like object concept representations emerge naturally in multimodal large language models

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Language in Brains, Minds, and Machines

An Artificial Neuron for Enhanced Problem Solving in Large Language Models

LLM4Brain: Training a Large Language Model for Brain Video Understanding

Conceptual structure coheres in human cognition but not in large language models

Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs