Abstract:Building a general and efficient path planning framework in uncertain nonconvex environments is challenging due to the safety constraints and complex configuration. Traditional avenues usually involve convexifying obstacles and presume Gaussian distribution, which are not universal. Meanwhile, the fast convergence of high-quality solutions is not guaranteed. Therefore, we develop a novel neural risk-bounded path planner to quickly find near-optimal solutions that have an acceptable collision probability in the complex environments. Firstly, we retrieve the nonconvex obstacles with arbitrary probabilistic uncertainties in the form of a deterministic point cloud map. A neural network sampler encodes it into a latent embedding and is trained with sufficient expert demonstrations, predicting states in the potential subspace. We construct a neural cost estimator to select the best informed state from those samples. Then, we recursively use the simple yet effective neural networks to march toward the start and goal bidirectionally. The collision risk of the intermediate connections is verified based on sum-of-squares optimization. Simulation results show that our approach significantly saves time and resources in finding comparable solutions over the state-of-the-art methods in the seen and unseen challenging environments. Note to Practitioners—More and more robots are deployed in unstructured environments, such as forests and subterranean caves. However, uncertainty in the environment situational awareness usually causes accidents. To quickly generate safe paths without over-conservation in uncertain complex environments, we propose a neural risk-bounded sampling-based path planner. Conventional methods consume lots of computation time and resources to generate satisfactory results. Our learning-based risk-bounded path planning framework can efficiently find paths with a guaranteed risk tolerance avoiding uncertain nonconvex static obstacles. It imitates the expert to generate informed states in a subspace that potentially contains the optimal solution. In practice, we need to formulate the observed uncertain obstacle at a grid map into the polynomial containing random variables and determine their probability distributions.

Path planning for multiple agents in an unknown environment using soft actor critic and curriculum learning

Learning Hierarchical Graph-Based Policy for Goal-Reaching in Unknown Environments

Dynamic Path Planning for Mobile Robots with Deep Reinforcement Learning

A Path-Planning Method Based on Improved Soft Actor-Critic Algorithm for Mobile Robots

Multi-Agent Path Planning Method Based on Improved Deep Q-Network in Dynamic Environments

Multi-agent policy learning-based path planning for autonomous mobile robots

Cooperative Game-Based Multi-Agent Path Planning With Obstacle Avoidance

Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay

Learning-Based Risk-Bounded Path Planning Under Environmental Uncertainty

Path Planning in Dynamic Environments through Trajectory Prediction and Reinforcement Learning

Multiple Suboptimal Policies Integrated Reinforcement Learning Algorithm for Path Planning

A deep reinforcement learning based method for real-time path planning and dynamic obstacle avoidance

Path Planning of a Mobile Robot for a Dynamic Indoor Environment Based on an SAC-LSTM Algorithm

Curriculum Learning Based Multi-Agent Path Finding for Complex Environments.

Autonomous navigation of mobile robots in unknown environments using off-policy reinforcement learning with curriculum learning

A Decentralized Multi-Agent Path Planning Approach Based on Imitation Learning and Global Static Feature Extraction

Attention-Cooperated Reinforcement Learning for Multi-agent Path Planning

Automated Curriculum Reinforcement Learning in Path-finding Scenarios

Motion Path Planning of Agent Based on Proximal Policy Optimization Algorithm

Improving the Generalization of Unseen Crowd Behaviors for Reinforcement Learning based Local Motion Planners

A Soft Actor-Critic Deep Reinforcement-Learning-Based Robot Navigation Method Using LiDAR