Abstract:Robotic motion planning in dense and dynamic indoor scenarios constantly challenges the researchers because of the motion unpredictability of obstacles. Recent progress in reinforcement learning enables robots to better cope with the dense and unpredictable obstacles by encoding complex features of the robot and obstacles into the encoders like the long-short term memory (LSTM). Then these features are learned by the robot using reinforcement learning algorithms, such as the deep Q network and asynchronous advantage actor critic algorithm. However, existing methods depend heavily on expert experiences to enhance the convergence speed of the networks by initializing them via imitation learning. Moreover, those approaches based on LSTM to encode the obstacle features are not always efficient and robust enough, therefore sometimes causing the network overfitting in training. This paper focuses on the advantage actor critic algorithm and introduces an attention-based actor critic algorithm with experience replay algorithm to improve the performance of existing algorithm from two perspectives. First, LSTM encoder is replaced by a robust encoder attention weight to better interpret the complex features of the robot and obstacles. Second, the robot learns from its past prioritized experiences to initialize the networks of the advantage actor-critic algorithm. This is achieved by applying the prioritized experience replay method, which makes the best of past useful experiences to improve the convergence speed. As results, the network based on our algorithm takes only around 15% and 30% experiences to get rid of the early-stage training without the expert experiences in cases with five and ten obstacles, respectively. Then it converges faster to a better reward with less experiences (near 45% and 65% of experiences in cases with ten and five obstacles respectively) when comparing with the baseline LSTM-based advantage actor critic algorithm. Our source code is freely available at the GitHub (https://github.com/CHUENGMINCHOU/AW-PER-A2C).

An Intelligent Robot Motion Planning Method and Application Via LPPO in Unknown Environment

Real-Time Path Planning for Mobile Robots

On-line Real-Time Path Planning of Mobile Robots in Dynamic Uncertain Environment

Efficient Motion Planning Based on Kinodynamic Model for Quadruped Robots Following Persons in Confined Spaces

A Path Planning Algorithm Based on Deep Reinforcement Learning for Mobile Robots in Unknown Environment

Robotic Arm Motion Planning Based on Curriculum Reinforcement Learning

Efficient Online Planning and Robust Optimal Control for Nonholonomic Mobile Robot in Unstructured Environments

An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios

Multi-Robot Motion Planning: A Learning-Based Artificial Potential Field Solution

A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning

Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout

Multi-goal Motion Planning of an Autonomous Robot in Unknown Environments by an Ant Colony Optimization Approach.

Real-Time Path Planning for Robot Using OP-PRM in Complex Dynamic Environment

Nobel Lecture. Protein phosphorylation and cellular regulation I.

Safe and Robust Motion Planning for Autonomous Navigation of Quadruped Robots in Cluttered Environments

An optimized Q-Learning algorithm for mobile robot local path planning

Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning

An Optimized Probabilistic Roadmap Algorithm for Path Planning of Mobile Robots in Complex Environments with Narrow Channels

Reinforcement based mobile robot path planning with improved dynamic window approach in unknown environment

Model-free Motion Planning of Autonomous Agents for Complex Tasks in Partially Observable Environments

Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic motion planning