Towards biologically plausible model-based reinforcement learning in recurrent spiking networks by dreaming new experiences

Cristiano Capone,Pier Stanislao Paolucci
DOI: https://doi.org/10.1038/s41598-024-65631-y
IF: 4.6
2024-06-27
Scientific Reports
Abstract:Humans and animals can learn new skills after practicing for a few hours, while current reinforcement learning algorithms require a large amount of data to achieve good performances. Recent model-based approaches show promising results by reducing the number of necessary interactions with the environment to learn a desirable policy. However, these methods require biological implausible ingredients, such as the detailed storage of older experiences, and long periods of offline learning. The optimal way to learn and exploit world-models is still an open question. Taking inspiration from biology, we suggest that dreaming might be an efficient expedient to use an inner model. We propose a two-module (agent and model) spiking neural network in which "dreaming" (living new experiences in a model-based simulated environment) significantly boosts learning. Importantly, our model does not require the detailed storage of experiences, and learns online the world-model and the policy. Moreover, we stress that our network is composed of spiking neurons, further increasing the biological plausibility and implementability in neuromorphic hardware.
multidisciplinary sciences
What problem does this paper attempt to address?