Neuroevolution of Recurrent Architectures on Control Tasks

Maximilien Le Clei,Pierre Bellec
2023-04-04
Abstract:Modern artificial intelligence works typically train the parameters of fixed-sized deep neural networks using gradient-based optimization techniques. Simple evolutionary algorithms have recently been shown to also be capable of optimizing deep neural network parameters, at times matching the performance of gradient-based techniques, e.g. in reinforcement learning settings. In addition to optimizing network parameters, many evolutionary computation techniques are also capable of progressively constructing network architectures. However, constructing network architectures from elementary evolution rules has not yet been shown to scale to modern reinforcement learning benchmarks. In this paper we therefore propose a new approach in which the architectures of recurrent neural networks dynamically evolve according to a small set of mutation rules. We implement a massively parallel evolutionary algorithm and run experiments on all 19 OpenAI Gym state-based reinforcement learning control tasks. We find that in most cases, dynamic agents match or exceed the performance of gradient-based agents while utilizing orders of magnitude fewer parameters. We believe our work to open avenues for real-life applications where network compactness and autonomous design are of critical importance. We provide our source code, final model checkpoints and full results at <a class="link-external link-http" href="http://github.com/MaximilienLC/nra" rel="external noopener nofollow">this http URL</a>.
Neural and Evolutionary Computing,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is whether, in modern reinforcement learning tasks, dynamically constructing neural network architectures through evolutionary algorithms can reach or even exceed the performance of static networks based on gradient - optimization techniques. Specifically, the author proposes a new method, that is, dynamically evolving the architecture of Recurrent Neural Networks (RNNs) through a set of simple mutation rules, and conducts experiments on 19 state - control tasks in OpenAI Gym using a massively parallel evolutionary algorithm. The study finds that, in most cases, dynamic agents can not only match or surpass the performance of gradient - based agents, but also use several orders of magnitude fewer parameters. This indicates that dynamic networks have significant advantages in compactness and autonomous design, which is especially important for scenarios in practical applications that require network compactness and autonomous design.