Abstract:Previous reports have described that neural activities in midbrain dopamine areas are sensitive to unexpected reward delivery and omission. These activities are correlated with reward prediction error in reinforcement learning models, the difference between predicted reward values and the obtained reward outcome. These findings suggest that the reward prediction error signal in the brain updates reward prediction through stimulus-reward experiences. It remains unknown, however, how sensory processing of reward-predicting stimuli contributes to the computation of reward prediction error. To elucidate this issue, we examined the relation between stimulus discriminability of the reward-predicting stimuli and the reward prediction error signal in the brain using functional magnetic resonance imaging (fMRI). Before main experiments, subjects learned an association between the orientation of a perceptually salient (high-contrast) Gabor patch and a juice reward. The subjects were then presented with lower-contrast Gabor patch stimuli to predict a reward. We calculated the correlation between fMRI signals and reward prediction error in two reinforcement learning models: a model including the modulation of reward prediction by stimulus discriminability and a model excluding this modulation. Results showed that fMRI signals in the midbrain are more highly correlated with reward prediction error in the model that includes stimulus discriminability than in the model that excludes stimulus discriminability. No regions showed higher correlation with the model that excludes stimulus discriminability. Moreover, results show that the difference in correlation between the two models was significant from the first session of the experiment, suggesting that the reward computation in the midbrain was modulated based on stimulus discriminability before learning a new contingency between perceptually ambiguous stimuli and a reward. These results suggest that the human reward system can incorporate the level of the stimulus discriminability flexibly into reward computations by modulating previously acquired reward values for a typical stimulus.

Reward Inference by Primate Prefrontal and Striatal Neurons

Reward Prediction in Prefrontal Cortex and Striatum

Dissociable Functions of Reward Inference in the Lateral Prefrontal Cortex and the Striatum

Reward Prediction Based on Stimulus Categorization in Primate Lateral Prefrontal Cortex

[Reward information encoded by power of local field potentials in the primate prefrontal cortex and striatum].

Stimulus and reward information encoded by population neurons in the primate prefrontal cortex and striatum

Differences In Reward Processing Between Putative Cell Types In Primate Prefrontal Cortex

Category Inference and Prefrontal Cortex

A Comparison of Reward Values Encoding Function Between the Prefrontal Cortex and Striatum in Monkey

Injection of Muscimol into Prefrontal Cortex Impairs Monkey's Reward Transitive Inference

Contributions of distinct prefrontal neuron classes in reward processing

Quantitative Analysis of Functional Connectivity Between Prefrontal Cortex and Striatum in Monkey

Reward-Modulated Functional Connectivity Between Prefrontal Cortex and Striatum

Model-based reward prediction in the primate prefrontal cortex

Signal Interaction Between Prefrontal Cortex and Striatum in Reward Prediction

Causal Interaction Between Prefrontal Cortex and Striatum Estimated by Granger Causality

Stimulus-dependent adjustment of reward prediction error in the midbrain

Movement-independent representation of reward-predicting cues in the medial part of the primate premotor cortex

Dopamine-independent effect of rewards on choices through hidden-state inference

Dopamine neurons learn to encode the long-term value of multiple future rewards

Anterior cingulate learns reward distribution