V2X Cooperative Perception for Autonomous Driving: Recent Advances and Challenges

Tao Huang,Jianan Liu,Xi Zhou,Dinh C. Nguyen,Mostafa Rahimi Azghadi,Yuxuan Xia,Qing-Long Han,Sumei Sun
2024-05-09
Abstract:Accurate perception is essential for advancing autonomous driving and addressing safety challenges in modern transportation systems. Despite significant advancements in computer vision for object recognition, current perception methods still face difficulties in complex real-world traffic environments. Challenges such as physical occlusion and limited sensor field of view persist for individual vehicle systems. Cooperative Perception (CP) with Vehicle-to-Everything (V2X) technologies has emerged as a solution to overcome these obstacles and enhance driving automation systems. While some research has explored CP's fundamental architecture and critical components, there remains a lack of comprehensive summaries of the latest innovations, particularly in the context of V2X communication technologies. To address this gap, this paper provides a comprehensive overview of the evolution of CP technologies, spanning from early explorations to recent developments, including advancements in V2X communication technologies. Additionally, a contemporary generic framework is also proposed to illustrate the V2X-based CP workflow, aiding in the structured understanding of CP system components. Furthermore, this paper categorizes prevailing V2X-based CP methodologies based on the critical issues they address. An extensive literature review is conducted within this taxonomy, evaluating existing datasets and simulators. Finally, open challenges and future directions in CP for autonomous driving are discussed by considering both perception and V2X communication advancements.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitations of single - vehicle perception technology in autonomous driving, especially in the face of challenges such as physical occlusion in complex traffic environments, limited sensor field of view (Field of View, FoV), and insufficient sensor resolution. Despite significant progress in the field of computer vision, these technologies are still difficult to cope with complex situations in the real world, such as high traffic volume, frequent pedestrian activities, and blocked lines of sight. To overcome these limitations, the paper explores cooperative perception (Cooperative Perception, CP) based on vehicle - to - everything (Vehicle - to - Everything, V2X) communication technology as a solution. V2X communication technology enables connected autonomous vehicles (Connected Autonomous Vehicles, CAVs) to enhance their perception capabilities through wireless information exchange, thereby improving road safety. Cooperative perception technology generates accurate environmental representations through data fusion among multiple agents, which helps to solve the problems existing in single - vehicle systems. Specifically, the paper aims to: 1. **Provide a comprehensive overview**: Provide a comprehensive overview of the evolution of cooperative perception technology, from early exploration to recent developments, including the progress of V2X communication technology. 2. **Propose a general framework**: Introduce a modern general framework to show the V2X - based cooperative perception workflow and its system components in a structured manner. 3. **Classify existing methods**: Classify existing V2X - based cooperative perception methods according to the key problems they solve. 4. **Literature review**: Conduct an extensive literature review of existing data sets and simulation tools within the proposed classification system. 5. **Discuss challenges and future directions**: Considering the progress of perception and V2X communication technologies, discuss the open challenges and future research directions of cooperative perception in autonomous driving. Through these efforts, the paper hopes to fill the gaps in existing reviews, provide a systematic understanding framework for researchers and engineers, and promote the further development of cooperative perception technology in the field of autonomous driving.