Abstract:Robotic motion planning in dense and dynamic indoor scenarios constantly challenges the researchers because of the motion unpredictability of obstacles. Recent progress in reinforcement learning enables robots to better cope with the dense and unpredictable obstacles by encoding complex features of the robot and obstacles into the encoders like the long-short term memory (LSTM). Then these features are learned by the robot using reinforcement learning algorithms, such as the deep Q network and asynchronous advantage actor critic algorithm. However, existing methods depend heavily on expert experiences to enhance the convergence speed of the networks by initializing them via imitation learning. Moreover, those approaches based on LSTM to encode the obstacle features are not always efficient and robust enough, therefore sometimes causing the network overfitting in training. This paper focuses on the advantage actor critic algorithm and introduces an attention-based actor critic algorithm with experience replay algorithm to improve the performance of existing algorithm from two perspectives. First, LSTM encoder is replaced by a robust encoder attention weight to better interpret the complex features of the robot and obstacles. Second, the robot learns from its past prioritized experiences to initialize the networks of the advantage actor-critic algorithm. This is achieved by applying the prioritized experience replay method, which makes the best of past useful experiences to improve the convergence speed. As results, the network based on our algorithm takes only around 15% and 30% experiences to get rid of the early-stage training without the expert experiences in cases with five and ten obstacles, respectively. Then it converges faster to a better reward with less experiences (near 45% and 65% of experiences in cases with ten and five obstacles respectively) when comparing with the baseline LSTM-based advantage actor critic algorithm. Our source code is freely available at the GitHub (https://github.com/CHUENGMINCHOU/AW-PER-A2C).

Achieving mouse-level strategic evasion performance using real-time computational planning

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

Behavioral anatomy of a hunt : Using dynamic real-world paradigm and computer vision to compare human user-generated strategies with prey movement varying in predictability

A Real-Time Immune Planning Algorithm Incorporating a Specific Immune Mechanism for Multi-Robots in Complex Environments.

Spatial planning with long visual range benefits escape from visual predators in complex naturalistic environments

Planning strategy for intruder agent based on game theory and artificial potential field

Optimizing Evasive Strategies for an Evader with Imperfect Vision Capacity

An advantage actor-critic algorithm for robotic motion planning in dense and dynamic scenarios

Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic motion planning

Reactive Neural Path Planning with Dynamic Obstacle Avoidance in a Condensed Configuration Space

Mouse Escape Behaviors and mPFC-BLA Activity Dataset: Understanding Flexible Defensive Strategies Under Threat

Autonomous 3D Exploration in Large-Scale Environments with Dynamic Obstacles

Edge Accelerated Robot Navigation With Collaborative Motion Planning

EV-Planner: Energy-Efficient Robot Navigation via Event-Based Physics-Guided Neuromorphic Planner

On efficient computation in active inference

An efficient multi-robot path planning solution using A* and coevolutionary algorithms

Speeding Up Optimization-based Motion Planning through Deep Learning

Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games

Planning with a Receding Horizon for Manipulation in Clutter using a Learned Value Function

Deceptive Planning for Resource Allocation

Influence-Augmented Online Planning for Complex Environments