GoalNet: Goal Areas Oriented Pedestrian Trajectory Prediction

Ching-Lin Lee,Zhi-Xuan Wang,Kuan-Ting Lai,Amar Fadillah
2024-02-29
Abstract:Predicting the future trajectories of pedestrians on the road is an important task for autonomous driving. The pedestrian trajectory prediction is affected by scene paths, pedestrian's intentions and decision-making, which is a multi-modal problem. Most recent studies use past trajectories to predict a variety of potential future trajectory distributions, which do not account for the scene context and pedestrian targets. Instead of predicting the future trajectory directly, we propose to use scene context and observed trajectory to predict the goal points first, and then reuse the goal points to predict the future trajectories. By leveraging the information from scene context and observed trajectory, the uncertainty can be limited to a few target areas, which represent the "goals" of the pedestrians. In this paper, we propose GoalNet, a new trajectory prediction neural network based on the goal areas of a pedestrian. Our network can predict both pedestrian's trajectories and bounding boxes. The overall model is efficient and modular, and its outputs can be changed according to the usage scenario. Experimental results show that GoalNet significantly improves the previous state-of-the-art performance by 48.7% on the JAAD and 40.8% on the PIE dataset.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the problem of accurately predicting pedestrian trajectories in autonomous driving scenarios. Specifically, the paper proposes a new multimodal trajectory prediction model called GoalNet, which aims to improve the accuracy of predicting pedestrians' future trajectories by leveraging scene context and observed trajectories to predict pedestrians' goal points. Unlike most previous studies that directly predict future trajectory distributions based on past trajectories, GoalNet first predicts goal points and then uses these goal points to predict future trajectories. This approach can confine uncertainty to a few target areas, representing pedestrians' "goals," thereby improving prediction accuracy. The main contributions of the paper include: 1. Developing a new multimodal trajectory prediction model, GoalNet, which significantly improves multimodal trajectory prediction performance on the JAAD and PIE trajectory prediction datasets by 48.7% and 40.8%, respectively. 2. Unlike traditional Conditional Variational Autoencoders (CVAE) or RNN-based models, the paper applies and integrates many advanced convolutional structures and methods, proposing an efficient design based on Convolutional Neural Networks (CNNs) and studying the impact of each sub-network in GoalNet. 3. Providing a method to predict corresponding bounding box trajectories from center point trajectories. Through these improvements, GoalNet not only achieves technical breakthroughs but also offers higher practicality and accuracy in real-world applications, particularly in autonomous driving systems, where it can help vehicles anticipate pedestrian behavior in advance, thereby avoiding potential collision risks.