Neuroevolution of Recurrent Architectures on Control Tasks

Maximilien Le Clei,Pierre Bellec

2023-04-04

Abstract:Modern artificial intelligence works typically train the parameters of fixed-sized deep neural networks using gradient-based optimization techniques. Simple evolutionary algorithms have recently been shown to also be capable of optimizing deep neural network parameters, at times matching the performance of gradient-based techniques, e.g. in reinforcement learning settings. In addition to optimizing network parameters, many evolutionary computation techniques are also capable of progressively constructing network architectures. However, constructing network architectures from elementary evolution rules has not yet been shown to scale to modern reinforcement learning benchmarks. In this paper we therefore propose a new approach in which the architectures of recurrent neural networks dynamically evolve according to a small set of mutation rules. We implement a massively parallel evolutionary algorithm and run experiments on all 19 OpenAI Gym state-based reinforcement learning control tasks. We find that in most cases, dynamic agents match or exceed the performance of gradient-based agents while utilizing orders of magnitude fewer parameters. We believe our work to open avenues for real-life applications where network compactness and autonomous design are of critical importance. We provide our source code, final model checkpoints and full results at <a class="link-external link-http" href="http://github.com/MaximilienLC/nra" rel="external noopener nofollow">this http URL</a>.

Neural and Evolutionary Computing,Machine Learning

What problem does this paper attempt to address?

The problem that this paper attempts to solve is whether, in modern reinforcement learning tasks, dynamically constructing neural network architectures through evolutionary algorithms can reach or even exceed the performance of static networks based on gradient - optimization techniques. Specifically, the author proposes a new method, that is, dynamically evolving the architecture of Recurrent Neural Networks (RNNs) through a set of simple mutation rules, and conducts experiments on 19 state - control tasks in OpenAI Gym using a massively parallel evolutionary algorithm. The study finds that, in most cases, dynamic agents can not only match or surpass the performance of gradient - based agents, but also use several orders of magnitude fewer parameters. This indicates that dynamic networks have significant advantages in compactness and autonomous design, which is especially important for scenarios in practical applications that require network compactness and autonomous design.

Neuroevolution of Recurrent Architectures on Control Tasks

Generative Adversarial Neuroevolution for Control Behaviour Imitation

Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning

The Ant Swarm Neuro-Evolution Procedure for Optimizing Recurrent Networks

An Efficient Application of Neuroevolution for Competitive Multiagent Learning

Breaching the Bottleneck: Evolutionary Transition from Reward-Driven Learning to Reward-Agnostic Domain-Adapted Learning in Neuromodulated Neural Nets

Multitask Neuroevolution for Reinforcement Learning with Long and Short Episodes

Asynchronous Evolution of Deep Neural Network Architectures

Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks

Neural Architecture Search using Progressive Evolution

Limited Evaluation Evolutionary Optimization of Large Neural Networks

Neuroevolution in Deep Neural Networks: Current Trends and Future Challenges

Towards the Neuroevolution of Low-level artificial general intelligence

Improving the performance of mutation-based evolving artificial neural networks with self-adaptive mutations

Evolving neural networks to follow trajectories of arbitrary complexity

Improving the sample-efficiency of neural architecture search with reinforcement learning

Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization

odNEAT: An Algorithm for Decentralised Online Evolution of Robotic Controllers

Evolutionary Reinforcement Learning via Cooperative Coevolution

Multi-Objective Neural Evolutionary Algorithm for Combinatorial Optimization Problems

Deep Neuroevolution of Recurrent and Discrete World Models