Abstract:Traditional optimization-based planners, while effective, suffer from high computational costs, resulting in slow trajectory generation. A successful strategy to reduce computation time involves using Imitation Learning (IL) to develop fast neural network (NN) policies from those planners, which are treated as expert demonstrators. Although the resulting NN policies are effective at quickly generating trajectories similar to those from the expert, (1) their output does not explicitly account for dynamic feasibility, and (2) the policies do not accommodate changes in the constraints different from those used during training.

What problem does this paper attempt to address?

This paper proposes a solution to the high computational cost and dynamic feasibility issues in traditional optimization-based trajectory planning methods for obstacle avoidance and UAV (Unmanned Aerial Vehicle) trajectory planning. To reduce computation time, the researchers use imitation learning (IL) to train a fast neural network (NN) policy to imitate these high-cost planners. However, the NN policy in this approach may not consider dynamic feasibility and cannot adapt to different constraints during training. To overcome these limitations, the paper introduces the Constrained-Guided Diffusion (CGD) approach, which is an innovative IL-based trajectory planning method. CGD combines diffusion strategies with an efficiently solvable surrogate optimization problem to generate collision-free and dynamically feasible trajectories. It decomposes the originally complex optimization problem into two more manageable sub-problems: finding collision avoidance paths and determining the time parametrization of these paths to obtain trajectories. Compared to traditional neural network architectures, CGD demonstrates significantly improved performance and dynamic feasibility in scenarios with new constraints. The key ideas of CGD include: 1. Using diffusion models to capture multimodal path distributions and prevent mode collapse. 2. Modifying intermediate trajectories through block coordinate descent to encourage constraint satisfaction. 3. Iteratively adjusting time parametrization and solving quadratic programming (QP) to ensure collision avoidance and dynamic feasibility. The paper also discusses the advantages of CGD over other methods, such as its ability to handle time-varying constraints and the trade-off between performance and computation time demonstrated in agile UAV obstacle avoidance tasks simulated. The related work section mentions existing neural network methods that guarantee constraint satisfaction and diffusion model methods with constraint satisfaction. CGD modifies the intermediate trajectories generated by the diffusion model through a block coordinate descent heuristic approach to promote constraint satisfaction while addressing the problem of different constraints during deployment compared to training. In summary, the paper aims to address how to create a fast, dynamically feasible, and adaptable UAV trajectory planning method through imitation learning and innovative optimization strategies.

CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning

Efficient Optimization-Based Trajectory Planning for Unmanned Systems in Confined Environments

PEP: Policy-Embedded Trajectory Planning for Autonomous Driving

Automatic Parameter Adaptation for Quadrotor Trajectory Planning

Perception-Aware Based UAV Trajectory Planner Via Generative Adversarial Self-Imitation Learning from Demonstrations

Learning-Initialized Trajectory Planning in Unknown Environments

Multi-UAV Trajectory Planning Using Gradient-Based Sequence Minimal Optimization.

Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following

Differential Flatness-based Fast Trajectory Planning for Fixed-wing Unmanned Aerial Vehicles

Guided Policy Search using Sequential Convex Programming for Initialization of Trajectory Optimization Algorithms

Deep Reinforcement Learning Based Trajectory Real-Time Planning for Hypersonic Gliding Vehicles

UAV Path Planning Based on Multicritic-Delayed Deep Deterministic Policy Gradient

Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning

Three-Dimension Trajectory Design for Multi-UAV Wireless Network With Deep Reinforcement Learning

Learning to Plan Maneuverable and Agile Flight Trajectory with Optimization Embedded Networks

Control-Aware Trajectory Predictions for Communication-Efficient Drone Swarm Coordination in Cluttered Environments

Deep reinforcement learning-based reactive trajectory planning method for UAVs

Learning Trajectories for Real- Time Optimal Control of Quadrotors.

Real-Time On-the-Fly Motion Planning for Urban Air Mobility via Updating Tree Data of Sampling-Based Algorithms Using Neural Network Inference

Constraint-Aware Diffusion Models for Trajectory Optimization

Trajectory Planning for Unmanned Aerial Vehicles in Complicated Urban Environments: A Control Network Approach