Do Agents Dream of Electric Sheep?: Improving Generalization in Reinforcement Learning through Generative Learning

Giorgio Franceschelli,Mirco Musolesi
2024-03-13
Abstract:The Overfitted Brain hypothesis suggests dreams happen to allow generalization in the human brain. Here, we ask if the same is true for reinforcement learning agents as well. Given limited experience in a real environment, we use imagination-based reinforcement learning to train a policy on dream-like episodes, where non-imaginative, predicted trajectories are modified through generative augmentations. Experiments on four ProcGen environments show that, compared to classic imagination and offline training on collected experience, our method can reach a higher level of generalization when dealing with sparsely rewarded environments.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?