Modeling social interaction and intention for pedestrian trajectory prediction

Kai Chen,Xiao Song,Xiaoxiang Ren
DOI: https://doi.org/10.1016/j.physa.2021.125790
2021-05-01
Abstract:<p>Future pedestrian trajectory prediction offers great prospects for many practical applications. Most existing methods focus on social interaction among pedestrians but ignore the fact that in addition to pedestrians there are other kinds of objects (cars, dogs, bicycles, motorcycles, etc.) with a great influence on the subject pedestrian's future trajectory. Most existing methods neglect the intentions of the pedestrian, which can be obtained by the key points of the subject pedestrian's face. Therefore, rich category information about the subject pedestrian's surroundings and face key points plays a great role in promoting the modeling of pedestrian movement. Motivated by this idea, this paper tries to predict a pedestrian's future trajectory by jointly using various categories and the relative positions of the subject pedestrian's surroundings and the key points in his face. We propose a data modeling method to effectively unify rich visual features about categories, interaction and face key points into a multi-channel tensor and build an end-to-end fully convolutional encoder–decoder attention model based on convolutional long-short-term memory utilizing this tensor. We evaluate and compare our method with several existing methods on 5 crowded video sequences from the public dataset multi-object tracking (MOT) -16. Experimental results show that our method outperforms state-of-the-art approaches, with less prediction error.</p>
What problem does this paper attempt to address?