Enhancing 3D Lane Detection and Topology Reasoning with 2D Lane Priors

Han Li,Zehao Huang,Zitian Wang,Wenge Rong,Naiyan Wang,Si Liu
2024-06-05
Abstract:3D lane detection and topology reasoning are essential tasks in autonomous driving scenarios, requiring not only detecting the accurate 3D coordinates on lane lines, but also reasoning the relationship between lanes and traffic elements. Current vision-based methods, whether explicitly constructing BEV features or not, all establish the lane anchors/queries in 3D space while ignoring the 2D lane priors. In this study, we propose Topo2D, a novel framework based on Transformer, leveraging 2D lane instances to initialize 3D queries and 3D positional embeddings. Furthermore, we explicitly incorporate 2D lane features into the recognition of topology relationships among lane centerlines and between lane centerlines and traffic elements. Topo2D achieves 44.5% OLS on multi-view topology reasoning benchmark OpenLane-V2 and 62.6% F-Socre on single-view 3D lane detection benchmark OpenLane, exceeding the performance of existing state-of-the-art methods.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address two main issues in the context of autonomous driving: 3D lane detection and topological relationship reasoning. Specifically: 1. **3D Lane Detection**: In autonomous driving systems, 3D lane detection is a critical task that requires precise detection of the 3D coordinates of lane lines. Existing vision-based methods typically initialize lane anchors or queries in 3D space but ignore 2D lane prior information. This leads to limited performance improvement in 3D lane detection. 2. **Topological Relationship Reasoning**: Besides detecting the 3D coordinates of lane lines, it is also necessary to infer the relationships between lanes and between lanes and traffic elements (such as traffic lights, road signs, etc.). Existing methods usually only use 3D lane features for reasoning, neglecting the additional information provided by 2D lane features. To address these issues, the paper proposes a new framework called Topo2D, which utilizes 2D lane instances to initialize 3D queries and 3D position embeddings, and explicitly incorporates 2D lane features into the reasoning of topological relationships. With these improvements, Topo2D achieves performance surpassing existing state-of-the-art methods on the multi-view topological reasoning benchmark OpenLane-V2 and the single-view 3D lane detection benchmark OpenLane. ### Main Contributions: - **Utilizing 2D Lane Priors**: Enhancing 3D lane perception performance by initializing 3D queries and position embeddings with 2D lane instances obtained from a 2D lane decoder. - **Explicit Use of 2D Lane Information**: Combining 2D lane features explicitly with 3D lane features to better identify the topological relationships between lane centerlines and between lane centerlines and traffic elements. - **Performance Validation**: Validating the effectiveness of Topo2D on the multi-view topological reasoning benchmark OpenLane-V2 and the single-view 3D lane detection benchmark OpenLane, achieving state-of-the-art performance. ### Experimental Results: - On the OpenLane-V2 dataset, Topo2D achieved a 44.5% OLS score in the multi-view topological reasoning task, significantly surpassing other state-of-the-art methods. - On the OpenLane dataset, Topo2D achieved a 62.6% F-Score in the single-view 3D lane detection task, also surpassing existing state-of-the-art methods. Through these improvements and experimental results, the paper demonstrates that utilizing 2D lane prior information can significantly enhance the performance of 3D lane detection and topological relationship reasoning.