Abstract:Point cloud registration is an essential technology in computer vision and robotics. Recently, transformer-based methods have achieved advanced performance in point cloud registration by utilizing the advantages of the transformer in order-invariance and modeling dependencies to aggregate information. However, they still suffer from indistinct feature extraction, sensitivity to noise, and outliers, owing to three major limitations: 1) the adoption of CNNs fails to model global relations due to their local receptive fields, resulting in extracted features susceptible to noise; 2) the shallow-wide architecture of transformers and the lack of positional information lead to indistinct feature extraction due to inefficient information interaction; and 3) the insufficient consideration of geometrical compatibility leads to the ambiguous identification of incorrect correspondences. To address the above-mentioned limitations, a novel full transformer network for point cloud registration is proposed, named the deep interaction transformer (DIT), which incorporates: 1) a point cloud structure extractor (PSE) to retrieve structural information and model global relations with the local feature integrator (LFI) and transformer encoders; 2) a deep-narrow point feature transformer (PFT) to facilitate deep information interaction across a pair of point clouds with positional information, such that transformers establish comprehensive associations and directly learn the relative position between points; and 3) a geometric matching-based correspondence confidence evaluation (GMCCE) method to measure spatial consistency and estimate correspondence confidence by the designed triangulated descriptor. Extensive experiments on the ModelNet40, ScanObjectNN, and 3DMatch datasets demonstrate that our method is capable of precisely aligning point clouds, consequently, achieving superior performance compared with state-of-the-art methods. The code is publicly available at https://github.com/CGuangyan-BIT/DIT.

Deep Interactive Full Transformer Framework for Point Cloud Registration.

Full Transformer Framework for Robust Point Cloud Registration With Deep Information Interaction

End-to-end point cloud registration with transformer

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

Fast and Robust Point Cloud Registration with Tree-based Transformer

PointTr: Low-Overlap Point Cloud Registration with Transformer

PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer

Point Tree Transformer for Point Cloud Registration

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

Low-Overlap Point Cloud Registration With Transformer

A Registration Method of Overlap Aware Point Clouds Based on Transformer-to-Transformer Regression

Geometric Transformer for Fast and Robust Point Cloud Registration

2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds

GeoTransformer: Fast and Robust Point Cloud Registration with Geometric Transformer

3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification

Robust Point Cloud Registration Framework Based on Deep Graph Matching

Learning compact and overlap-biased interactions for point cloud registration

EGST: Enhanced Geometric Structure Transformer for Point Cloud Registration

Spatial deformable transformer for 3D point cloud registration

A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration