Prefrontal cortex as a meta-reinforcement learning system

Jane X. Wang,Zeb Kurth-Nelson,Dharshan Kumaran,Dhruva Tirumala,Hubert Soyer,Joel Z. Leibo,Demis Hassabis,Matthew Botvinick
DOI: https://doi.org/10.1038/s41593-018-0147-8
IF: 25
2018-05-14
Nature Neuroscience
Abstract:Over the past 20 years, neuroscience research on reward-based learning has converged on a canonical model, under which the neurotransmitter dopamine ‘stamps in’ associations between situations, actions and rewards by modulating the strength of synaptic connections between neurons. However, a growing number of recent findings have placed this standard model under strain. We now draw on recent advances in artificial intelligence to introduce a new theory of reward-based learning. Here, the dopamine system trains another part of the brain, the prefrontal cortex, to operate as its own free-standing learning system. This new perspective accommodates the findings that motivated the standard model, but also deals gracefully with a wider range of observations, providing a fresh foundation for future research.
neurosciences
What problem does this paper attempt to address?