Multimodal Forward Generation Transformer Network for Inconspicuous Pedestrian Trajectory Prediction

Ang Feng,Ruiqi Qiu,Jinglong Wang,Jun Gong,Yang Yi,Mingtao Dong
DOI: https://doi.org/10.1109/lra.2024.3351002
IF: 5.2
2024-03-01
IEEE Robotics and Automation Letters
Abstract:Pedestrian's future trajectory prediction is a key challenge in ego-centric view of autonomous driving system. Most of the current methods are flawed in capturing subtle change features in a lightweight model size. To solve this problem, we propose a multimodal forward generation transformer network based on encoder-decoder structure. Different from the traditional transformer, we improve layer normalization and propose frame normalization, which can more successfully capture minute time-variant properties. In addition, we believe that considering short-term pedestrian's future goals can help the ego-vehicle to predict more accurate and reasonable long-term pedestrians trajectory. Therefore, based on the idea of forward generation, the decoder considers the future short-term targets and uses trajectory-time correlation module to capture the relationship between estimated short-term future goals and global spatial-temporal context cues of the historical trajectory. Our model is evaluated on JAAD and PIE datasets and achieves state-of-the-art performance while maintaining a lightweight model size.
robotics
What problem does this paper attempt to address?