Abstract:Point cloud registration is an essential technology in computer vision and robotics. Recently, transformer-based methods have achieved advanced performance in point cloud registration by utilizing the advantages of the transformer in order-invariance and modeling dependencies to aggregate information. However, they still suffer from indistinct feature extraction, sensitivity to noise, and outliers, owing to three major limitations: 1) the adoption of CNNs fails to model global relations due to their local receptive fields, resulting in extracted features susceptible to noise; 2) the shallow-wide architecture of transformers and the lack of positional information lead to indistinct feature extraction due to inefficient information interaction; and 3) the insufficient consideration of geometrical compatibility leads to the ambiguous identification of incorrect correspondences. To address the above-mentioned limitations, a novel full transformer network for point cloud registration is proposed, named the deep interaction transformer (DIT), which incorporates: 1) a point cloud structure extractor (PSE) to retrieve structural information and model global relations with the local feature integrator (LFI) and transformer encoders; 2) a deep-narrow point feature transformer (PFT) to facilitate deep information interaction across a pair of point clouds with positional information, such that transformers establish comprehensive associations and directly learn the relative position between points; and 3) a geometric matching-based correspondence confidence evaluation (GMCCE) method to measure spatial consistency and estimate correspondence confidence by the designed triangulated descriptor. Extensive experiments on the ModelNet40, ScanObjectNN, and 3DMatch datasets demonstrate that our method is capable of precisely aligning point clouds, consequently, achieving superior performance compared with state-of-the-art methods. The code is publicly available at https://github.com/CGuangyan-BIT/DIT.

Dynamic Cues-Assisted Transformer for Robust Point Cloud Registration

Fast and Robust Point Cloud Registration with Tree-based Transformer

Full Transformer Framework for Robust Point Cloud Registration With Deep Information Interaction

End-to-end point cloud registration with transformer

Deep Interactive Full Transformer Framework for Point Cloud Registration.

PointTr: Low-Overlap Point Cloud Registration with Transformer

Point Tree Transformer for Point Cloud Registration

Low-Overlap Point Cloud Registration With Transformer

2D3D-MATR: 2D-3D Matching Transformer for Detection-free Registration between Images and Point Clouds

Geometric Transformer for Fast and Robust Point Cloud Registration

A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration

IGReg: Image-Geometry-Assisted Point Cloud Registration via Selective Correlation Fusion

PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer

OAAFormer: Robust and Efficient Point Cloud Registration Through Overlapping-Aware Attention in Transformer

Learning compact and overlap-biased interactions for point cloud registration

Spatial deformable transformer for 3D point cloud registration

RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration

A Registration Method of Overlap Aware Point Clouds Based on Transformer-to-Transformer Regression

GCMTN: Low-Overlap Point Cloud Registration Network Combining Dense Graph Convolution and Multilevel Interactive Transformer

GTPCR: Graph-Enhanced Transformer for Point Cloud Registration