Abstract:Vehicle-to-Vehicle (V2V) cooperative perception has become increasingly popular in the field of autonomous driving, effectively overcoming the inherent limitations of single-vehicle perception systems, such as limited range and susceptibility to occlusions. In a V2V system, vehicles in close proximity can share perception data. To fuse this data, which is collected from different viewpoints by each vehicle, accurate pose information (including position and heading direction) is essential to transform the received data to the receiving vehicle's viewpoint. However, pose errors, often caused by measurement noise or sensor failures, can lead to severe misalignment during data fusion, resulting in incorrect object detections and potentially hazardous decisions in autonomous driving systems. To address this challenge, we present BB-Align, a lightweight pose recovery framework that utilizes Lidar Bird's-eye View (BV) images and object bounding Boxes for relative pose estimation. Designed as a plug-and-play solution, the proposed method requires no additional model training, enabling effortless integration into existing V2V systems. Our approach uses Lidar-derived BV images with a Log-Gabor filter-based feature map for effective image matching despite image sparsity. To reduce errors from self-motion distortion, we also integrate object bounding boxes for finer alignment. The proposed method is rigorously evaluated on the V2V4Real dataset-currently the only real-world V2V dataset. Our approach demonstrates high pose estimation accuracy, outperforming an existing graph-matching method. It achieves translation and rotation errors of less than 1 m and 1., respectively, in 80% of cases within a 70 m range between vehicles. Furthermore, by integrating the proposed framework into cooperative object detection models under serious pose error, the result shows up to a 2x increase in Average Precision (AP) compared to those without pose recovery, with more pronounced improvements in the short range.

V2VFormer++: Multi-Modal Vehicle-to-Vehicle Cooperative Perception Via Global-Local Transformer

BB-Align: A Lightweight Pose Recovery Framework for Vehicle-to-Vehicle Cooperative Perception

Slim-FCP: Lightweight-Feature-Based Cooperative Perception for Connected Automated Vehicles

V2X-ViT: Vehicle-to-Everything Cooperative Perception with Vision Transformer

V2X-AHD:Vehicle-to-Everything Cooperation Perception via Asymmetric Heterogenous Distillation Network

CoFormerNet: A Transformer-Based Fusion Approach for Enhanced Vehicle-Infrastructure Cooperative Perception

R. M. Bucke: a Victorian asylum superintendent.

HM-ViT: Hetero-modal Vehicle-to-Vehicle Cooperative perception with vision transformer

CoBEVFusion: Cooperative Perception with LiDAR-Camera Bird's-Eye View Fusion

Cooperative Perception with Learning-Based V2V communications

HP3D-V2V: High-Precision 3D Object Detection Vehicle-to-Vehicle Cooperative Perception Algorithm

A Novel Probabilistic V2X Data Fusion Framework for Cooperative Perception

CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers

PACP: Priority-Aware Collaborative Perception for Connected and Autonomous Vehicles

Consensus-Based Distributed Cooperative Perception for Connected and Automated Vehicles

V2V4Real: A Real-world Large-scale Dataset for Vehicle-to-Vehicle Cooperative Perception

ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication

NLOS Dies Twice: Challenges and Solutions of V2X for Cooperative Perception