Abstract:There has been substantial growth in research on the robot automation, which aims to make robots capable of directly interacting with the world or human. Robot learning for automation from human demonstration is central to such situation. However, the dependence of demonstration restricts robot to a fixed scenario, without the ability to explore in variant situations to accomplish the same task as in demonstration. Deep reinforcement learning methods may be a good method to make robot learning beyond human demonstration and fulfilling the task in unknown situations. The exploration is the core of such generalization to different environments. While the exploration in reinforcement learning may be ineffective and suffer from the problem of low sample efficiency. In this paper, we present Evolutionary Policy Gradient (EPG) to make robot learn from demonstration and perform goal oriented exploration efficiently. Through goal oriented exploration, our method can generalize robot learned skill to environments with different parameters. Our Evolutionary Policy Gradient combines parameter perturbation with policy gradient method in the framework of Evolutionary Algorithms (EAs) and can fuse the benefits of both, achieving effective and efficient exploration. With demonstration guiding the evolutionary process, robot can accelerate the goal oriented exploration to generalize its capability to variant scenarios. The experiments, carried out in robot control tasks in OpenAI Gym with dense and sparse rewards, show that our EPG is able to provide competitive performance over the original policy gradient methods and EAs. In the manipulator task, our robot can learn to open the door with vision in environments which are different from where the demonstrations are provided.

Towards Behavior Control for Evolutionary Robot Based on Rl with Enn

RL and ANN Based Modular Path Planning Controller for Resource-Constrained Robots in the Indoor Complex Dynamic Environment

Adaptive Locomotion Control of a Hexapod Robot Via Bio-Inspired Learning

Generalize Robot Learning from Demonstration to Variant Scenarios with Evolutionary Policy Gradient

GA-Aided Elman Neural Network Controller For Behavior-Based Robot

Mobile Robot Behavior Controller Based on Genetic Diagonal Recurrent Neural Network

Evolution of Cooperative Ensemble Neural Network Controller for Autonomous Mobile Robots

An Automatic Control Model for Rat-Robot

Elman Neural Network Controller For Behavior-Based Robot

Application of Artificial Neural Network and Immune-Genetic Algorithm with Elitist to Cooperative Transport of Multi-robots System

Robot Behavior Control Based on Multi-LCS and the Artificial Potential Field

Research On Mobile Robot Behaviors Based On Chaotic Neural Network

Emergent adaptive behaviour of GRN-controlled simulated robots in a changing environment

Approach to controlling robot by artificial brain based on parallel evolutionary neural network

AN EVOLUTIONARY COMPUTATION APPROACH TO GROUP BEHAVIOR ACQUISITION IN AUTONOMOUS ROBOTS

A New Scheme Based On Reactive Behavior And Evolutionary Algorithm For Collision Avoidance Of Multi-Robot System

An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning

Adaptive Niche Genetic Algorithm Based Path Planning and Dynamic Obstacle Avoidance of Mobile Robots

Stability control method of a biped robot via neural network based on GA

Exploiting Inherent Regularity in Control of Multilegged Robot Locomotion by Evolving Neural Fields.

Model Adaptive Gait Scheme Based On Evolutionary Algorithm