CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration

Shuhao Kang,Youqi Liao,Jianping Li,Fuxun Liang,Yuhao Li,Xianghong Zou,Fangning Li,Xieyuanli Chen,Zhen Dong,Bisheng Yang

2024-09-12

Abstract:Image-to-point cloud (I2P) registration is a fundamental task for robots and autonomous vehicles to achieve cross-modality data fusion and localization. Current I2P registration methods primarily focus on estimating correspondences at the point or pixel level, often neglecting global alignment. As a result, I2P matching can easily converge to a local optimum if it lacks high-level guidance from global constraints. To improve the success rate and general robustness, this paper introduces CoFiI2P, a novel I2P registration network that extracts correspondences in a coarse-to-fine manner. First, the image and point cloud data are processed through a two-stream encoder-decoder network for hierarchical feature extraction. Second, a coarse-to-fine matching module is designed to leverage these features and establish robust feature correspondences. Specifically, In the coarse matching phase, a novel I2P transformer module is employed to capture both homogeneous and heterogeneous global information from the image and point cloud data. This enables the estimation of coarse super-point/super-pixel matching pairs with discriminative descriptors. In the fine matching module, point/pixel pairs are established with the guidance of super-point/super-pixel correspondences. Finally, based on matching pairs, the transform matrix is estimated with the EPnP-RANSAC algorithm. Experiments conducted on the KITTI Odometry dataset demonstrate that CoFiI2P achieves impressive results, with a relative rotation error (RRE) of 1.14 degrees and a relative translation error (RTE) of 0.29 meters, while maintaining real-time speed.Additional experiments on the Nuscenes datasets confirm our method's generalizability. The project page is available at \url{<a class="link-external link-https" href="https://whu-usi3dv.github.io/CoFiI2P" rel="external noopener nofollow">this https URL</a>}.

Computer Vision and Pattern Recognition,Artificial Intelligence,Robotics

What problem does this paper attempt to address?

The paper attempts to address the challenges in achieving image-to-point cloud (I2P) registration in robots and autonomous vehicles. Specifically: 1. **Limitations of existing methods**: Current I2P registration methods mainly focus on point or pixel-level correspondence estimation, often neglecting global alignment, which leads to matches easily converging to local optima. 2. **Need for cross-modal data fusion**: To achieve cross-modal data fusion and localization, a method is needed that can establish accurate pose transformation from the image coordinate system to the pre-built point cloud coordinate system. To address these issues, the paper proposes the CoFiI2P network, a novel coarse-to-fine I2P registration method that incrementally improves registration accuracy through a staged approach, avoiding local optima and significantly enhancing the success rate and robustness of registration. Experimental results on the KITTI Odometry and Nuscenes datasets show that CoFiI2P achieves significant improvements in both relative rotation error (RRE) and relative translation error (RTE), with improvements of 84% and 89% respectively compared to the current best methods.

CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration

CFI2P: Coarse-to-Fine Cross-Modal Correspondence Learning for Image-to-Point Cloud Registration

CoFi: Coarse-to-Fine ICP for LiDAR Localization in an Efficient Long-lasting Point Cloud Map

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

TFCT-I2P: Three stream fusion network with color aware transformer for image-to-point cloud registration

Coarse-to-fine heterogeneous point clouds registration in vehicle-to-infrastructure collaboration

Quantity-Aware Coarse-to-Fine Correspondence for Image-to-Point Cloud Registration

MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval

OL-Reg: Registration of Image and Sparse LiDAR Point Cloud with Object-Level Dense Correspondences

End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization

Point Cloud registration based on iterative closest point

HybridFusion: LiDAR and Vision Cross-Source Point Cloud Fusion

Cy-CNN: cylinder convolution based rotation-invariant neural network for point cloud registration

GFOICP: Geometric Feature Optimized Iterative Closest Point for 3-D Point Cloud Registration

Incremental registration towards large-scale heterogeneous point clouds by hierarchical graph matching

DCPCR: Deep Compressed Point Cloud Registration in Large-Scale Outdoor Environments

A Coarse-to-Fine Algorithm for Matching and Registration in 3D Cross-Source Point Clouds

LiPO: LiDAR Inertial Odometry for ICP Comparison

3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration

Robust real-world point cloud registration by inlier detection

Feature Interactive Representation for Point Cloud Registration