CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning

Kota Kondo,Andrea Tagliabue,Xiaoyi Cai,Claudius Tewari,Olivia Garcia,Marcos Espitia-Alvarez,Jonathan P. How
2024-05-03
Abstract:Traditional optimization-based planners, while effective, suffer from high computational costs, resulting in slow trajectory generation. A successful strategy to reduce computation time involves using Imitation Learning (IL) to develop fast neural network (NN) policies from those planners, which are treated as expert demonstrators. Although the resulting NN policies are effective at quickly generating trajectories similar to those from the expert, (1) their output does not explicitly account for dynamic feasibility, and (2) the policies do not accommodate changes in the constraints different from those used during training.
Robotics,Machine Learning,Systems and Control
What problem does this paper attempt to address?
This paper proposes a solution to the high computational cost and dynamic feasibility issues in traditional optimization-based trajectory planning methods for obstacle avoidance and UAV (Unmanned Aerial Vehicle) trajectory planning. To reduce computation time, the researchers use imitation learning (IL) to train a fast neural network (NN) policy to imitate these high-cost planners. However, the NN policy in this approach may not consider dynamic feasibility and cannot adapt to different constraints during training. To overcome these limitations, the paper introduces the Constrained-Guided Diffusion (CGD) approach, which is an innovative IL-based trajectory planning method. CGD combines diffusion strategies with an efficiently solvable surrogate optimization problem to generate collision-free and dynamically feasible trajectories. It decomposes the originally complex optimization problem into two more manageable sub-problems: finding collision avoidance paths and determining the time parametrization of these paths to obtain trajectories. Compared to traditional neural network architectures, CGD demonstrates significantly improved performance and dynamic feasibility in scenarios with new constraints. The key ideas of CGD include: 1. Using diffusion models to capture multimodal path distributions and prevent mode collapse. 2. Modifying intermediate trajectories through block coordinate descent to encourage constraint satisfaction. 3. Iteratively adjusting time parametrization and solving quadratic programming (QP) to ensure collision avoidance and dynamic feasibility. The paper also discusses the advantages of CGD over other methods, such as its ability to handle time-varying constraints and the trade-off between performance and computation time demonstrated in agile UAV obstacle avoidance tasks simulated. The related work section mentions existing neural network methods that guarantee constraint satisfaction and diffusion model methods with constraint satisfaction. CGD modifies the intermediate trajectories generated by the diffusion model through a block coordinate descent heuristic approach to promote constraint satisfaction while addressing the problem of different constraints during deployment compared to training. In summary, the paper aims to address how to create a fast, dynamically feasible, and adaptable UAV trajectory planning method through imitation learning and innovative optimization strategies.