CSIR: Cascaded Sliding CVAEs with Iterative Socially-Aware Rethinking for Trajectory Prediction

Hao Zhou,Xu Yang,Dongchun Ren,Hai Huang,Mingyu Fan
DOI: https://doi.org/10.1109/tits.2023.3300730
IF: 8.5
2023-01-01
IEEE Transactions on Intelligent Transportation Systems
Abstract:Pedestrian trajectory prediction is a hot research topic in many applications, such as video surveillance and autonomous driving. Although many efforts have been done on this topic, there are still many challenges, including accumulated prediction errors, insufficient training data usage, and future-past incompatibility. To overcome these challenges, we propose a novel trajectory prediction method, called CSIR, which consists of a cascaded sliding conditional variational autoencoder (CS-CVAE) module and an iterative future-past social compatible rethinking (I-SCR) module. The CS-CVAE module reduces the accumulated prediction errors by using cascaded prediction models for the early future time steps. In this way, the training losses of the early time steps are separately considered and minimized from the later losses. For the following time steps in CS-CVAE, a sliding prediction model with a longer observation time span is used and additional data from the future time span can be collected for training. On the other hand, the I-SCR module generates offsets to improve the predictions iteratively by checking the interaction compatibility between the predicted trajectories and the past trajectories, which resembles with the human rethinking mechanism in motion planning. Experiments results on two widely explored pedestrian trajectory prediction datasets, Stanford Drone Dataset (SDD) and ETH/UCY, show that the proposed method surpasses previous state-of-the-art methods by notable margins.
What problem does this paper attempt to address?