MacFormer: Map-Agent Coupled Transformer for Real-time and Robust Trajectory Prediction

Chen Feng,Hangning Zhou,Huadong Lin,Zhigang Zhang,Ziyao Xu,Chi Zhang,Boyu Zhou,Shaojie Shen
DOI: https://doi.org/10.1109/LRA.2023.3311351
2023-08-31
Abstract:Predicting the future behavior of agents is a fundamental task in autonomous vehicle domains. Accurate prediction relies on comprehending the surrounding map, which significantly regularizes agent behaviors. However, existing methods have limitations in exploiting the map and exhibit a strong dependence on historical trajectories, which yield unsatisfactory prediction performance and robustness. Additionally, their heavy network architectures impede real-time applications. To tackle these problems, we propose Map-Agent Coupled Transformer (MacFormer) for real-time and robust trajectory prediction. Our framework explicitly incorporates map constraints into the network via two carefully designed modules named coupled map and reference extractor. A novel multi-task optimization strategy (MTOS) is presented to enhance learning of topology and rule constraints. We also devise bilateral query scheme in context fusion for a more efficient and lightweight network. We evaluated our approach on Argoverse 1, Argoverse 2, and nuScenes real-world benchmarks, where it all achieved state-of-the-art performance with the lowest inference latency and smallest model size. Experiments also demonstrate that our framework is resilient to imperfect tracklet inputs. Furthermore, we show that by combining with our proposed strategies, classical models outperform their baselines, further validating the versatility of our framework.
Computer Vision and Pattern Recognition,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to achieve real - time and robust trajectory prediction in the field of autonomous vehicles. Specifically, the paper points out that existing methods have limitations in using map information and mainly rely on historical trajectories, resulting in poor prediction performance and robustness. In addition, the network architectures of these methods are heavy and it is difficult to achieve real - time applications. To overcome these problems, the paper proposes a new framework named Map - Agent Coupled Transformer (MacFormer), aiming to improve the accuracy and robustness of trajectory prediction by directly coupling map constraints with agent motion while maintaining lightweight and efficient characteristics. ### Main Contributions: 1. **Map - Agent Coupled Framework**: A new map - agent coupled framework is proposed, which can effectively integrate map constraints into the system, achieving this goal through coupling the map and reference extractors and the multi - task optimization strategy (MTOS). 2. **Efficient Context Fusion Scheme**: A bilateral query scheme is designed, allowing parallel context fusion between the map and the agent, thus significantly reducing the time and space complexity. 3. **State - of - the - Art Performance**: Evaluations are carried out on multiple large - scale real - world benchmark datasets (such as Argoverse 1&2 and nuScenes), and the results show that MacFormer not only achieves state - of - the - art performance but also has lower inference latency and fewer parameters. ### Method Overview: - **Coupled Layer**: Extract historical motion features and construct a coupled map, and calculate the historical relative motion of the map and the agent at each timestamp. - **Reference Extractor**: Learn map - related references from the coupled map features to guide the spatial distribution of the predicted trajectory. - **Multi - Task Optimization Strategy (MTOS)**: Ensure that the prediction results are affected by map constraints through the multi - task optimization strategy, including the coupled motion task, the motion capture task, and the main prediction task. - **Bilateral Query**: Promote effective context fusion between the map and the agent through the bilateral query strategy to improve computational efficiency. ### Experimental Results: - **Performance Improvement**: On multiple benchmark datasets, MacFormer achieves the lowest inference latency and the smallest model size while reaching state - of - the - art performance. - **Robustness**: The experiments also show that this framework has strong robustness against imperfect trajectory inputs. - **Applicability**: Combined with the proposed strategies, the performance of the classical model is also improved, further verifying the universality of the framework. In conclusion, through proposing the MacFormer framework, this paper solves the deficiencies of existing trajectory prediction methods in using map information and real - time performance, providing a new solution for trajectory prediction in the field of autonomous driving.