Abstract:Efficiently finding safe and feasible trajectories for mobile objects is a critical field in robotics and computer science. In this paper, we propose SIL-RRT*, a novel learning-based motion planning algorithm that extends the RRT* algorithm by using a deep neural network to predict a distribution for sampling at each iteration. We evaluate SIL-RRT* on various 2D and 3D environments and establish that it can efficiently solve high-dimensional motion planning problems with fewer samples than traditional sampling-based algorithms. Moreover, SIL-RRT* is able to scale to more complex environments, making it a promising approach for solving challenging robotic motion planning problems.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: in a high - dimensional environment, how to efficiently find safe and feasible trajectories for moving objects. Specifically, the authors proposed the SIL - RRT* algorithm, aiming to improve the traditional RRT* algorithm through deep - learning techniques, in order to reduce the number of required samples and improve the efficiency of path planning in complex environments. ### Problem Background In the fields of robotics and computer science, motion planning is a key research direction, and its goal is to find a safe and feasible trajectory for a robot from an initial state to a target state. As the complexity of robot applications increases, the motion - planning problem requires algorithms that are both computationally feasible and efficient. Although traditional sampling - based algorithms (such as PRM, RRT, and RRT*) are effective, they often require a large number of samples to find a feasible solution in a high - dimensional environment. ### Research Motivation To improve sampling efficiency, some studies have introduced heuristic - biased samplers, such as BIT* and Informed RRT*. However, these methods still have limitations in computational efficiency. Therefore, this paper proposes a new learning - based motion - planning algorithm - SIL - RRT*, which uses deep - learning techniques, especially the Transformer architecture, to predict the sampling distribution in each iteration, thereby more effectively solving the high - dimensional motion - planning problem. ### Main Contributions 1. **Introduction of the SIL - RRT* algorithm**: Combining deep - learning and self - imitation learning (Self Imitation Learning), a new sampling - based motion - planning algorithm is proposed. 2. **Use of the Transformer architecture**: The Transformer model is used to process point - cloud data, capturing the relationship between the state space and the path, enhancing the flexibility and extensibility of the algorithm. 3. **Weighted self - imitation learning (WSIL)**: By dynamically selecting high - quality solutions, the sample utilization rate during the training process is improved, especially suitable for high - dimensional environments. 4. **Experimental verification**: Through experiments in various 2D and 3D environments, the effectiveness of SIL - RRT* in reducing the number of samples and improving the path quality is proven. ### Mathematical Formula Representation - **Definition of the optimal path**: \[ \tau^*=\arg\min_{\tau\in T} c(\tau) \] where \(c(\tau)\) is the cost function of the path, and \(T\) is the set of all collision - free paths. - **Sampler loss function**: \[ L_{\text{sampler}} = -\frac{1}{BN}\sum_{i = 1}^{N}\log\pi_\theta(x_i|p, g, x_{i - 1}) \] - **Estimator loss function**: \[ L_{\text{estimator}}=\frac{1}{2}\|C_{\text{real}}-C_{\text{est}}\|^2 \] - **Weighted self - imitation learning weight**: \[ w=\frac{1}{1+e^{(C_{\text{real}}-C_{\text{est}}-K)}} \] Through these innovations, SIL - RRT* not only reduces the number of required samples but also significantly improves the path - planning performance in complex environments.

SIL-RRT*: Learning Sampling Distribution through Self Imitation Learning

Custom Distribution for Sampling-Based Motion Planning

AM-RRT*: Informed Sampling-based Planning with Assisting Metric

On-Road Trajectory Planning With Spatio-Temporal Rrt* And Always-Feasible Quadratic Program

Asymptotical RRT-Based Path Planning for Mobile Robots in Dynamic Environments

Long-Horizon Motion Planning via Sampling and Segmented Trajectory Optimization

RL-RRT: Kinodynamic Motion Planning via Learning Reachability Estimators from RL Policies

Bi-AM-RRT*: A Fast and Efficient Sampling-Based Motion Planning Algorithm in Dynamic Environments

Efficient Sampling-Based Motion Planning for On-Road Autonomous Driving

Greedy Heuristics for Sampling-based Motion Planning in High-Dimensional State Spaces

MOD-RRT*: A Sampling-Based Algorithm for Robot Path Planning in Dynamic Environment

An Adaptive Multi-Rrt Approach for Robot Motion Planning

Extended RRT-based Path Planning for Flying Robots in Complex 3D Environments with Narrow Passages

Efficient Robot Motion Planning Via Sampling and Optimization

Dynamic Region-biased Rapidly-exploring Random Trees

Robotic Motion Planning using Learned Critical Sources and Local Sampling

A Fast RRT Algorithm for Motion Planning of Autonomous Road Vehicles

Experience Learning From Basic Patterns for Efficient Robot Navigation in Indoor Environments

Informed Sampling for Asymptotically Optimal Path Planning (Consolidated Version)

NAMR-RRT: Neural Adaptive Motion Planning for Mobile Robots in Dynamic Environments