Policy optimization emerges from noisy representation learning

Jonah Wolfsdorf Brenner,Chenguang Li,Gabriel Kreiman
DOI: https://doi.org/10.1101/2024.11.01.621621
2024-11-03
Abstract:Nervous systems learn representations of the world and policies to act within it. We present a framework that uses reward-dependent noise to facilitate policy optimization in representation learning networks. These networks balance extracting normative features and task-relevant information to solve tasks. Moreover, their representation changes reproduce several experimentally observed shifts in the neural code during task learning. Our framework presents a biologically plausible mechanism for emergent policy optimization amid evidence that representation learning plays a vital role in governing neural dynamics.
Neuroscience
What problem does this paper attempt to address?