Slow and Steady Wins the Race: Maintaining Plasticity with Hare and Tortoise Networks

Hojoon Lee,Hyeonseo Cho,Hyunseung Kim,Donghu Kim,Dugki Min,Jaegul Choo,Clare Lyle
2024-06-01
Abstract:This study investigates the loss of generalization ability in neural networks, revisiting warm-starting experiments from Ash & Adams. Our empirical analysis reveals that common methods designed to enhance plasticity by maintaining trainability provide limited benefits to generalization. While reinitializing the network can be effective, it also risks losing valuable prior knowledge. To this end, we introduce the Hare & Tortoise, inspired by the brain's complementary learning system. Hare & Tortoise consists of two components: the Hare network, which rapidly adapts to new information analogously to the hippocampus, and the Tortoise network, which gradually integrates knowledge akin to the neocortex. By periodically reinitializing the Hare network to the Tortoise's weights, our method preserves plasticity while retaining general knowledge. Hare & Tortoise can effectively maintain the network's ability to generalize, which improves advanced reinforcement learning algorithms on the Atari-100k benchmark. The code is available at <a class="link-external link-https" href="https://github.com/dojeon-ai/hare-tortoise" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the issue of neural networks gradually losing their generalization ability during training. Specifically, the researchers revisited the warm-up experiments by Ash & Adams (2020) and found that common methods to enhance plasticity (i.e., maintaining training capability) have limited effects on improving generalization. While reinitializing the network can effectively restore training capability, it may also lead to the loss of existing knowledge. To tackle this challenge, the paper introduces a new network architecture called "Hare & Tortoise," inspired by the brain's complementary learning systems. This architecture consists of two components: 1. **Hare Network**: Similar to the hippocampus, it can quickly adapt to new information. 2. **Tortoise Network**: Similar to the neocortex, it gradually integrates knowledge. By periodically resetting the weights of the Hare Network to those of the Tortoise Network, this method can maintain plasticity while preserving general knowledge. Experiments demonstrate that the Hare & Tortoise architecture effectively maintains the network's generalization ability across multiple experimental settings, particularly excelling in the reinforcement learning Atari-100k benchmark. In summary, the main goal of the paper is to address the problem of neural networks losing their generalization ability during prolonged training and to propose a new architecture that balances rapid adaptation and stable generalization.