Agents-Separated Prediction-Former: A Novel Multi-Agent Trajectory Prediction Model Based on Transformer with 2D Input

Tianchu Zeng,Ning Gui,Jianming Hu,Yi Zhang
DOI: https://doi.org/10.1061/9780784484326.018
2022-01-01
Abstract:Trajectory prediction has long been a basic task in traffic situation awareness. However, challenges in interactions, continuity, and multi-modal behavior have to be handled for more accurate prediction. In order to obtain a better performance, previous works have started to incorporate recurrent neural networks, graph neural networks, even transformer into their models. Some models construct the temporal and spatial features independently, and then synthesize the processed features to predict the trajectories; meantime other models construct the these features together and add modules to distinguish the features in pursuing of better effects. Their models, however, may have not jointly trained temporal and spatial modules so they may neglect agents' temporal and spatial features that supposed to be pertinent, resulting in the separation among features and decrease its performance; in addition, those models encoding temporal and spatial features simultaneously are compelled to add functional modules to identify where is an agent's histories. Given the above disadvantages, instead, we propose a novel agents-separated prediction-former. We think that the input can be taken in a two-dimension form but not in a sequence, which easily tackles the task of distinguishing the agents' identity. Since we can only put in one scene in one time, we incorporate agents' velocities through differential. Whereafter, we make it auto-regressive to confirm the trajectories' temporal continuity. Thus, our model can jointly incorporate temporal and spatial features as well as identify each agent's histories. Based on our novel idea, we then construct our final trajectory predictor adopting a probability model to obtain probability outputs to generate multi-modal trajectories that accord with the reality. Experiments reveals that our model can improve the performance in several proposed data sets.
What problem does this paper attempt to address?