Abstract:Abstract State-space and action representations form the building blocks of decision-making processes in the brain; states map external cues to the current situation of the agent whereas actions provide the set of motor commands from which the agent can choose to achieve specific goals. Although these factors differ across environments, it is not currently known whether or how accurately state and action representations are acquired by the agent because previous experiments have typically provided this information a priori through instruction or pre-training. Here we show that, in the absence of such a priori knowledge, state and action representations adapt to reflect the structure of the world. We used a sequential decision-making task in rats in which they were required to pass through multiple states before reaching the goal, and for which the number of states and how they map onto external cues were not known a priori. We found that, early in training, animals selected actions as if the task was not sequential and outcomes were the immediate consequence of the most proximal action. During the course of training, however, rats recovered the true structure of the environment and made decisions based on the expanded state-space, reflecting the multiple stages of the task. We found a similar pattern with actions; early in training animals only considered the execution of single actions whereas, after training, they created useful action sequences that expanded the set of available actions. We conclude that the profile of choices shows a gradual shift from simple representations of actions and states to more complex structures compatible with the structure of the world.

States as goal-directed concepts: an epistemic approach to state-representation learning

Learning telic-controllable state representations

Learning the structure of the world: The adaptive nature of state-space and action representations in multi-stage decision-making

State Representations as Incentives for Reinforcement Learning Agents: A Sim2Real Analysis on Robotic Grasping

Learning Actionable Representations with Goal-Conditioned Policies

Internal states emerge early during learning of a perceptual decision-making task

Computational mechanisms of curiosity and goal-directed exploration

Learning Causal State Representations of Partially Observable Environments

Goal-oriented inference of environment from redundant observations

State-dependent Online Reactivations for Different Learning Strategies in Foraging

Modeling Complex Animal Behavior with Latent State Inverse Reinforcement Learning

A First-Occupancy Representation for Reinforcement Learning

State2Explanation: Concept-Based Explanations to Benefit Agent Learning and User Understanding

Unsupervised State Representation Learning in Atari

An Overview of Natural Language State Representation for Reinforcement Learning

Goal-Driven Cognition in the Brain: A Computational Framework

Bridging State and History Representations: Understanding Self-Predictive RL

Humans rationally balance detailed and temporally abstract world models

Information is Power: Intrinsic Control via Information Capture

Distinct Processing of the State Prediction Error Signals in Frontal and Parietal Correlates in Learning the Environment Model.

Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments