Abstract:Only recently have the functional implications of the organization of the ventral striatum, amygdala, and related limbic-cortical structures, and their neuroanatomical interactions begun to be clarified. Processes of activation and reward have long been associated with the NAcc and its dopamine innervation, but the precise relationships between these constructs have remained elusive. We have sought to enrich our understanding of the special role of the ventral striatum in coordinating the contribution of different functional subsystems to confer flexibility, as well as coherence and vigor, to goal-directed behavior, through different forms of associative learning. Such appetitive behavior comprises many subcomponents, some of which we have isolated in these experiments to reveal that, not surprisingly, the mechanisms by which an animal sequences responding to reach a goal are complex. The data reveal how the different components, pavlovian approach (or sign-tracking), conditioned reinforcement (whereby pavlovian stimuli control goal-directed action), and also more general response-invigorating processes (often called "activation," "stress," or "drive") may be integrated within the ventral striatum through convergent interactions of the amygdala, other limbic cortical structures, and the mesolimbic dopamine system to produce coherent behavior. The position is probably not far different when considering aversively motivated behavior. Although it may be necessary to employ simplified, even abstract, paradigms for isolating these mechanisms, their concerted action can readily be appreciated in an adaptive, functional setting, such as the responding by rats for intravenous cocaine under a second-order schedule of reinforcement. Here, the interactions of primary reinforcement, psychomotor activation, pavlovian conditioning, and the control that drug cues exert over the integrated drug-seeking response can be seen to operate both serially and concurrently. The power of our analytic techniques for understanding complex motivated behavior has been evident for some time. However, the crucial point is that we are now able to map these components with increasing certainty onto discrete amygdaloid, and other limbic cortical-ventral striatal subsystems. The neural dissection of these mechanisms also serves an important theoretical purpose in helping to validate the various hypothetical constructs and further developing theory. Major challenges remain, not the least of which is an understanding of the operation of the ventral striatum together with its dopaminergic innervation and its interactions with the basolateral amygdala, hippocampal formation, and prefrontal cortex at a more mechanistic, neuronal level.

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types

Dopamine prediction error responses integrate subjective value from different reward dimensions

Neural mechanisms of adaptive value coding in the amygdala

Dopamine encoding of novelty facilitates efficient uncertainty-driven exploration

Value-Driven Adaptations of Mesolimbic Dopamine Release Are Governed by Both Model-Based and Model-Free Mechanisms

How cortico-basal ganglia-thalamic subnetworks can shift decision policies to maximize reward rate

A dopamine mechanism for reward maximization

Mesolimbic dopamine adapts the rate of learning from action

Dopamine neurons learn to encode the long-term value of multiple future rewards

Dopamine neurons encode trial-by-trial subjective reward value in an auction-like task

A selective role for dopamine in stimulus-reward learning

Dopamine-independent effect of rewards on choices through hidden-state inference

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

The Dopamine Prediction Error: Contributions to Associative Models of Reward Learning

Associative Processes in Addiction and Reward the Role of Amygdala‐Ventral Striatal Subsystems

A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells

A global dopaminergic learning rate enables adaptive foraging across many options

The many worlds hypothesis of dopamine prediction error: implications of a parallel circuit architecture in the basal ganglia

Dopamine neurons drive spatiotemporally heterogeneous striatal dopamine signals during learning