Two Dimensions of Value: Dopamine Neurons Represent Reward But Not Aversiveness

C. Fiorillo
DOI: https://doi.org/10.1126/science.1238699
IF: 56.9
2013-08-02
Science
Abstract:Coding Reward Versus Punishment Reinforcement learning is driven by reward prediction error, and a very influential theory has proposed that dopamine neurons provide this signal to teach value to the brain. Although this is called a reward prediction error, it has been assumed to also represent aversiveness. Thus, it was thought that the dopamine signal could be sufficient for learning total value. Fiorillo (p. 546) found that dopamine alone was not sufficient to encode value, implying that there must be an analogous signal for aversiveness. Experiments in monkeys suggest that positive and negative expectations are represented by different types of neurons. Whereas reward (appetitiveness) and aversiveness (punishment) have been distinguished as two discrete dimensions within psychology and behavior, physiological and computational models of their neural representation have treated them as opposite sides of a single continuous dimension of “value.” Here, I show that although dopamine neurons of the primate ventral midbrain are activated by evidence for reward and suppressed by evidence against reward, they are insensitive to aversiveness. This indicates that reward and aversiveness are represented independently as two dimensions, even by neurons that are closely related to motor function. Because theory and experiment support the existence of opponent neural representations for value, the present results imply four types of value-sensitive neurons corresponding to reward-ON (dopamine), reward-OFF, aversive-ON, and aversive-OFF.
What problem does this paper attempt to address?