Trajectory Grid Diffusion for Multimodal Trajectory Prediction in Autonomous Vehicles

Jincheng Wang,Jiayu Guo,Mingyue Feng,Chengjun Li,Xiangyang Xue,Jian Pu
DOI: https://doi.org/10.1109/tiv.2024.3495037
IF: 8.2
2024-01-01
IEEE Transactions on Intelligent Vehicles
Abstract:Trajectory prediction is a critical topic for safe autonomous driving. Recently, diffusion models have gained increasing attention as a promising approach for learning multimodal trajectory distribution. However, existing diffusion models for trajectory prediction tasks are mainly based on the modality of trajectory points, which exhibit rapidly increasing uncertainty over the temporal dimension in vehicle prediction. To overcome this limitation, we propose leveraging the bird's-eye-view (BEV) trajectory grids to represent trajectory in different time horizons. The diffusion on such representation will benefit from its robust context representation capabilities and inherently lower variance disparity among spatial pixel variables. To alleviate systematic errors arising from the limited resolution of rasterized images, Trajectory Refiner is employed to take as input rough trajectories extracted from the denoised BEV images and output accurate trajectory. Our method is rigorously tested on benchmark datasets including ETH/UCY and nuScence, demonstrating superior performance in long-term prediction accuracy compared to current diffusion-based trajectory prediction methods. Additionally, our experiments show that diffusion on trajectory grid can accelerate inference process with fewer denoising steps compared with trajectory points
What problem does this paper attempt to address?