MapsTP: HD Map Images Based Multimodal Trajectory Prediction for Automated Vehicles

Sushil Sharma,Arindam Das,Ganesh Sistu,Mark Halton,Ciarán Eising
2024-10-01
Abstract:Predicting ego vehicle trajectories remains a critical challenge, especially in urban and dense areas due to the unpredictable behaviours of other vehicles and pedestrians. Multimodal trajectory prediction enhances decision-making by considering multiple possible future trajectories based on diverse sources of environmental data. In this approach, we leverage ResNet-50 to extract image features from high-definition map data and use IMU sensor data to calculate speed, acceleration, and yaw rate. A temporal probabilistic network is employed to compute potential trajectories, selecting the most accurate and highly probable trajectory paths. This method integrates HD map data to improve the robustness and reliability of trajectory predictions for autonomous vehicles.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address the challenge of predicting the trajectory of an autonomous vehicle (i.e., the ego - vehicle) in complex urban environments. Specifically, the paper focuses on how to utilize high - definition maps (HD Map) and sensor data to improve the accuracy and reliability of trajectory prediction, especially in cases where the behavior of pedestrians and other vehicles is unpredictable. ### Main Problems 1. **Trajectory Prediction in Complex Environments**: - In urban and dense areas, due to the unpredictable behavior of other vehicles and pedestrians, the trajectory prediction of the ego - vehicle becomes particularly difficult. - Traditional physics - based methods perform well in short - term prediction but not in long - term prediction because they cannot handle the influence of external factors (such as weather, traffic conditions, and driver intentions) well. 2. **Multi - modal Trajectory Prediction**: - A single trajectory prediction method may not be sufficient to deal with complex traffic scenarios, so multiple possible future trajectories need to be considered. - Multi - modal prediction can consider different behavior patterns (such as changes in speed and direction), thereby improving the robustness of prediction. 3. **Fusion of Multiple Data Sources**: - High - definition maps provide detailed environmental information, such as lane markings, traffic signs, and road shapes. - Sensor data (such as the speed, acceleration, and yaw rate provided by IMU sensors) provide real - time state information of the vehicle. - How to effectively fuse these data from different sources is a key issue. ### Solutions The paper proposes a model named MapsTP, which solves the above problems in the following ways: - **Feature Extraction**: Use ResNet - 50 to extract features from high - definition map images and combine IMU sensor data to calculate the vehicle's speed, acceleration, and yaw rate. - **Spatio - Temporal Probability Network (STPN)**: Calculate multiple potential trajectories through a spatio - temporal probability network, assign probabilities to each trajectory, and select the most likely path. - **Multi - modal Prediction**: Consider multiple possible future trajectories to deal with uncertainty and complex behavior patterns. ### Summary The main objective of the paper is to develop a model that can accurately predict the trajectory of the ego - vehicle by fusing high - definition map and sensor data, thereby improving the safety and efficiency of autonomous vehicles in complex urban environments.