LGmap: Local-to-Global Mapping Network for Online Long-Range Vectorized HD Map Construction

Kuang Wu,Sulei Nian,Can Shen,Chuan Yang,Zhanbin Li
2024-06-20
Abstract:This report introduces the first-place winning solution for the Autonomous Grand Challenge 2024 - Mapless Driving. In this report, we introduce a novel online mapping pipeline LGmap, which adept at long-range temporal model. Firstly, we propose symmetric view transformation(SVT), a hybrid view transformation module. Our approach overcomes the limitations of forward sparse feature representation and utilizing depth perception and SD prior information. Secondly, we propose hierarchical temporal fusion(HTF) module. It employs temporal information from local to global, which empowers the construction of long-range HD map with high stability. Lastly, we propose a novel ped-crossing resampling. The simplified ped crossing representation accelerates the instance attention based decoder convergence performance. Our method achieves 0.66 UniScore in the Mapless Driving OpenLaneV2 test set.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of dynamically constructing high-precision local HD maps (HD Map) from onboard camera images and SD (Semantic Dense) maps in autonomous driving. Specifically, the paper proposes a novel online mapping pipeline named LGmap, which aims to overcome the limitations of existing methods in long-distance mapping and improve the stability and accuracy of mapping. ### Main Issues: 1. **Long-distance Mapping**: Existing methods have limitations in handling long-distance mapping, especially in long-term sequence mapping. 2. **View Transformation**: Forward projection and backward projection methods each have their own flaws, requiring a method that combines the advantages of both. 3. **Temporal Fusion**: How to effectively fuse local and global temporal information to improve the stability and accuracy of mapping. 4. **Pedestrian Crossing Area Representation**: Simplifying the representation of pedestrian crossing areas to accelerate model convergence. ### Solutions: 1. **Symmetric View Transformation (SVT)**: Combines the advantages of forward projection and backward projection, utilizing depth perception and SD prior information to generate more accurate BEV (Bird's-Eye-View) representations. 2. **Hierarchical Temporal Fusion (HTF)**: Improves the stability of long-distance mapping by fusing temporal information from local to global. 3. **Pedestrian Crossing Area Resampling**: Simplifies pedestrian crossing areas to four corner points and uniformly samples six points on each edge, simplifying the representation and accelerating model convergence. ### Experimental Results: - On the Mapless Driving OpenLaneV2 test set, the LGmap method achieved a UniScore of 0.66, demonstrating its superior performance in dynamically constructing local HD maps. Through these innovations, LGmap excels in the construction of HD maps in the field of autonomous driving, particularly in long-distance and long-term sequence mapping.