Abstract:Environmental cues, through Pavlovian learning, become conditioned stimuli that invigorate and guide animals toward acquisition of rewards. Dopamine neurons in the ventral tegmental area (VTA) and substantia nigra (SNC) are crucial for this process. Dopamine neurons are embedded in a reciprocally connected network with their striatal targets, the functional organization of which remains poorly understood. Here, we investigated how learning during optogenetic Pavlovian cue conditioning of VTA or SNC dopamine neurons directs cue-evoked behavior and shapes subregion-specific striatal dopamine dynamics. We used a fluorescent dopamine biosensor to monitor dopamine in the nucleus accumbens (NAc) core and shell, dorsomedial striatum (DMS), and dorsolateral striatum (DLS). We demonstrate spatially heterogeneous, learning-dependent dopamine changes across striatal regions. While VTA stimulation evoked robust dopamine release in NAc core, shell, and DMS, cues predictive of this activation preferentially recruited dopamine release in NAc core, starting early in training, and DMS, late in training. Corresponding negative prediction error signals, reflecting a violation in the expectation of dopamine neuron activation, only emerged in the NAc core and DMS, and not the shell. Despite development of vigorous movement late in training, conditioned dopamine signals did not similarly emerge in the DLS, even during Pavlovian conditioning with SNC dopamine neuron activation, which elicited robust DLS dopamine release. Together, our studies show broad dissociation in the fundamental prediction and reward-related information generated by different dopamine neuron populations and signaled by dopamine across the striatum. Further, they offer new insight into how larger-scale plasticity across the striatal network emerges during Pavlovian learning to coordinate behavior.

Vector-valued dopamine improves learning of continuous outputs in the striatum

Dopamine transients encode reward prediction errors independent of learning rates

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning

Striatal dopamine reflects individual long-term learning trajectories

Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time

An opponent striatal circuit for distributional reinforcement learning

Reinforcement Learning in a Neurally Controlled Robot Using Dopamine Modulated STDP

Dynamic shaping of dopamine signals during probabilistic Pavlovian conditioning

Dopamine transients follow a striatal gradient of reward time horizons

Dopamine, Updated: Reward Prediction Error and Beyond

Nigrostriatal Dopamine Signals Sequence-Specific Action-Outcome Prediction Errors

Dopamine Release Plateau and Outcome Signals in Dorsal Striatum Contrast with Classic Reinforcement Learning Formulations

Subsecond dopamine fluctuations in human striatum encode superposed error signals about actual and counterfactual reward

Action prediction error: a value-free dopaminergic teaching signal that drives stable learning

A causal link between prediction errors, dopamine neurons and learning

A feature-specific prediction error model explains dopaminergic heterogeneity

Rethinking dopamine as generalized prediction error

What is dopamine doing in model-based reinforcement learning?

The many worlds hypothesis of dopamine prediction error: implications of a parallel circuit architecture in the basal ganglia