L2T-BEV: Local Lane Topology Prediction from Onboard Surround-View Cameras in Bird's Eye View Perspective.

Shanding Ye,Tao Li,Ruihang Li,Zhijie Pan
DOI: https://doi.org/10.1007/978-981-99-8435-0_29
2024-01-01
Abstract:High definition maps (HDMaps) serve as the foundation for autonomous vehicles, encompassing various driving scenario elements, among which lane topology is critically important for vehicle perception and planning. Existing work on lane topology extraction predominantly relies on manual processing, while automated methods are limited to road topology extraction. Recently, road representation learning based on surround-view with bird’s-eye view (BEV) has emerged, which directly predicts localized vectorized maps around the vehicle. However, these maps cannot represent the topological relationships between lanes. As a solution, we propose a novel method, L2T-BEV, which learns local lane topology maps in BEV. This method utilizes the EfficientNet to extract features from surround-view images, followed by transforming these features into the BEV space through the Inverse Perspective Mapping (IPM). Nonetheless, the IPM transformation often suffers from distortion issues. To alleviate this, we add a learnable residual mapping function to the features after the IPM transformation. Finally, we employ a transformer network with learnable positional embedding to process the fused images, generating higher-precision lane topology. We validated our method on the NuScenes dataset, and the experimental results demonstrate the feasibility and excellent performance.
What problem does this paper attempt to address?