MemFusionMap: Working Memory Fusion for Online Vectorized HD Map Construction

Jingyu Song,Xudong Chen,Liupei Lu,Jie Li,Katherine A. Skinner
2024-09-26
Abstract:High-definition (HD) maps provide environmental information for autonomous driving systems and are essential for safe planning. While existing methods with single-frame input achieve impressive performance for online vectorized HD map construction, they still struggle with complex scenarios and occlusions. We propose MemFusionMap, a novel temporal fusion model with enhanced temporal reasoning capabilities for online HD map construction. Specifically, we contribute a working memory fusion module that improves the model's memory capacity to reason across history frames. We also design a novel temporal overlap heatmap to explicitly inform the model about the temporal overlap information and vehicle trajectory in the Bird's Eye View space. By integrating these two designs, MemFusionMap significantly outperforms existing methods while also maintaining a versatile design for scalability. We conduct extensive evaluation on open-source benchmarks and demonstrate a maximum improvement of 5.4% in mAP over state-of-the-art methods. The code for MemFusionMap will be made open-source upon publication of this paper.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning,Robotics
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key challenges in the construction of online high-definition maps (HD Map). Specifically: 1. **Insufficient performance in complex scenarios**: Existing single-frame input methods perform poorly in complex scenarios, especially in the presence of occlusions. 2. **Underutilization of temporal information**: Many existing methods fail to fully utilize temporal information, which is very beneficial for handling occlusions and other issues. 3. **Management of historical frame information**: Directly accumulating all temporal features into a single memory feature map makes it difficult for the network to handle complex road environments and prone to erroneous updates in occluded situations. To address these issues, the paper proposes **MemFusionMap**, a novel temporal fusion model with enhanced temporal reasoning capabilities for online high-definition map construction. The specific contributions include: - **Working memory fusion module**: By maintaining a fixed-lag working memory buffer, the model's memory capacity is improved, enabling reasoning across historical frames. - **Temporal overlap heatmap**: A new temporal overlap heatmap is designed to explicitly provide the model with temporal overlap information and vehicle trajectories, thereby enhancing the model's temporal reasoning capabilities. Through these innovations, MemFusionMap significantly outperforms existing methods in multiple benchmarks, particularly showing greater improvements in long perception range settings, which are more suitable for practical autonomous driving deployment.