Abstract:Within computational neuroscience, the algorithmic and neural basis of structure learning remains poorly understood. Concept learning is one primary example, which requires both a type of internal model expansion process (adding novel hidden states that explain new observations), and a model reduction process (merging different states into one underlying cause and thus reducing model complexity via meta-learning). Although various algorithmic models of concept learning have been proposed within machine learning and cognitive science, many are limited to various degrees by an inability to generalize, the need for very large amounts of training data, and/or insufficiently established biological plausibility. Using concept learning as an example case, we introduce a novel approach for modeling structure learning—and specifically state-space expansion and reduction—within the active inference framework and its accompanying neural process theory. Our aim is to demonstrate its potential to facilitate a novel line of active inference research in this area. The approach we lay out is based on the idea that a generative model can be equipped with extra (hidden state or cause) "slots" that can be engaged when an agent learns about novel concepts. This can be combined with a Bayesian model reduction process, in which any concept learning—associated with these slots—can be reset in favor of a simpler model with higher model evidence. We use simulations to illustrate this model's ability to add new concepts to its state space (with relatively few observations) and increase the granularity of the concepts it currently possesses. We also simulate the predicted neural basis of these processes. We further show that it can accomplish a simple form of "one-shot" generalization to new stimuli. Although deliberately simple, these simulation results highlight ways in which active inference could offer useful resources in developing neurocomputational models of structure learning. They provide a template for how future active inference research could apply this approach to real-world structure learning problems and assess the added utility it may offer.

Reconciling Shared versus Context-Specific Information in a Neural Network Model of Latent Causes

Towards Human-like Perception: Learning Structural Causal Model in Heterogeneous Graph

The Causal-Neural Connection: Expressiveness, Learnability, and Inference

Learning Latent Causal Structures with a Redundant Input Neural Network

Unleashing the Potential of Spiking Neural Networks for Sequential Modeling with Contextual Embedding.

Latent circuit inference from heterogeneous neural responses during cognitive tasks

Recognizing Cognitive Load by a Hybrid Spatio-Temporal Causal Model from Multivariate Physiological Data

UCLN: Toward the Causal Understanding of Brain Disorders With Temporal Lag Dynamics

The Ubiquity of Time in Latent-cause Inference

Event Causality Identification Via Competitive-Cooperative Cognition Networks

Amortized learning of neural causal representations

Latent Conjunctive Bayesian Network: Unify Attribute Hierarchy and Bayesian Network for Cognitive Diagnosis

Cause and Effect: Can Large Language Models Truly Understand Causality?

Enhancing the Performance of Neural Networks Through Causal Discovery and Integration of Domain Knowledge

Curriculum effects and compositionality emerge with in-context learning in neural networks

Latent representations in hippocampal network model co-evolve with behavioral exploration of task structure

Multilayer In-Place Learning Networks: Multitask Invariance And Adaptive Lateral Connections

Latent Cognizance: What Machine Really Learns

Generalized Independent Noise Condition for Estimating Causal Structure with Latent Variables

Deep Recurrent Modelling of Granger Causality with Latent Confounding

An Active Inference Approach to Modeling Structure Learning: Concept Learning as an Example Case