Abstract:Denoising Diffusion Probabilistic Models (DDPMs) are powerful generative deep learning models that have been very successful at image generation, and, very recently, in path planning and control. In this paper, we investigate how to leverage the generalization and conditional-sampling capabilities of DDPMs to generate complex paths for a robotic end effector. We show that training a DDPM with synthetical and low-quality demonstrations is sufficient for generating nontrivial paths reaching arbitrary targets and avoiding obstacles. Additionally, we investigate different strategies for conditional sampling combining classifier-free and classifier-guided approaches. Eventually, we deploy the DDPM in a receding-horizon control scheme to enhance its planning capabilities. The Denoising Diffusion Planner is experimentally validated through various experiments on a Franka Emika Panda robot.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is: how to use Denoising Diffusion Probabilistic Models (DDPMs) to generate complex and obstacle - avoiding end - effector paths of robots when only using low - quality demonstration data. Specifically, the paper explores the following points: 1. **Path planning problems**: Traditional path - planning methods (such as combinatorial algorithms and sampling - based methods) can ensure finding solutions, but they are less efficient in complex environments. Deep Reinforcement Learning (DRL) methods can handle high - dimensional problems, but their prediction errors will accumulate over time. Therefore, a new method is needed to efficiently generate complex paths and avoid obstacles. 2. **Application of DDPM**: DDPM, as a powerful generative model, has achieved remarkable success in the field of image generation. The paper proposes to apply DDPM to path planning and generate paths that meet specific conditions (such as obstacle avoidance) through conditional sampling. In particular, the paper explores how to train DDPM using low - quality synthetic data and verify its performance in the real world. 3. **Closed - loop planning**: In order to improve the quality of path planning, the paper introduces a closed - loop planning strategy, that is, continuously updating the path planning during each execution, similar to Model Predictive Control (MPC). This helps to deal with the problem of the reduction in the weight of future postures due to reward discounting. ### Specific problem descriptions - **Path generation**: How to use DDPM to generate complex paths from the starting pose to the target pose while avoiding obstacles. - **Low - quality data training**: How to train DDPM using only low - quality synthetic data (such as straight - line paths) so that it can generate complex obstacle - avoiding paths. - **Conditional sampling**: How to generate paths that meet specific conditions through conditional sampling (such as classifier - free guidance and cost - guidance). - **Closed - loop control**: How to improve the quality and robustness of path planning through the closed - loop planning strategy. ### Solutions The paper proposes a method named "Denoising Diffusion Planner (DDP)", and its main contributions include: 1. **No need to rely on the robot dynamics model**: DDP can directly learn from low - quality synthetic data without the need for an accurate robot dynamics model. 2. **Training with only low - quality synthetic data**: DDP can be trained using only simple straight - line paths, thus simplifying the data collection process. 3. **Verification through real - world experiments**: The effectiveness and flexibility of DDP are verified through experiments on the Franka Emika Panda robot. ### Conclusions This paper proposes a novel path - planning method by combining DDPM, conditional sampling, and closed - loop control. The experimental results show that DDP can generate complex and obstacle - avoiding robot paths when only using low - quality synthetic data and exhibits good performance in the real world.

Denoising Diffusion Planner: Learning Complex Paths from Low-Quality Demonstrations

Multi-Step Denoising Scheduled Sampling: Towards Alleviating Exposure Bias for Diffusion Models

Trajectory Generation, Control, and Safety with Denoising Diffusion Probabilistic Models

Denoising Heat-inspired Diffusion with Insulators for Collision Free Motion Planning

Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following

Prediction with Action: Visual Policy Learning via Joint Denoising Process

CoBL-Diffusion: Diffusion-Based Conditional Robot Planning in Dynamic Environments Using Control Barrier and Lyapunov Functions

Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation

SafeDiffuser: Safe Planning with Diffusion Probabilistic Models

Denoising Diffusion-Based Control of Nonlinear Systems

Multi-Agent Path Planning based on MPC and DDPG

DARE: Diffusion Policy for Autonomous Robot Exploration

Diffusion Meets Options: Hierarchical Generative Skill Composition for Temporally-Extended Tasks

Think While You Generate: Discrete Diffusion with Planned Denoising

DiPPeR: Diffusion-based 2D Path Planner applied on Legged Robots

Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction

DiMSam: Diffusion Models as Samplers for Task and Motion Planning under Partial Observability

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

A decentralized path planning model based on deep reinforcement learning

Adaptive Online Replanning with Diffusion Models