Delving into the Secrets of BEV 3D Object Detection in Autonomous Driving: A Comprehensive Survey

Haoyu Li,Yueran Zhao,Jiaru Zhong,Bo Wang,Chao Sun,Fengchun Sun
DOI: https://doi.org/10.36227/techrxiv.173221675.59410416/v1
2024-01-01
Abstract:3D object detection plays a crucial role in autonomous driving, with Bird’s Eye View (BEV) becoming increasingly popular for its rich contextual information, ease of multimodal fusion, and scalability. Despite its advantages, current BEV-based 3D detection methods still face significant challenges, including multi-modal fusion, communication bottlenecks, robustness under varying conditions, and safety concerns. This survey provides a systematic review of recent advancements in BEV perception, organizing the research based on these focal areas. It spans a broad range of perspectives, offering valuable insights for future perception research. Additionally, this survey explores the influence of emerging technologies, such as large language models and end-to-end frameworks on enhancing BEV perception capabilities, focusing on improving performance and robustness. Key future directions include: (1) advancing from isolated vehicle perception to V2X cooperative perception; (2) evolving from single-modal to integrated multi-modal fusion; (3) shifting from simulated environments to real-world applications; and (4) transitioning from hierarchical perception frameworks to interpretable, end-to-end large-scale models.
What problem does this paper attempt to address?