Abstract:<p>Efficient registration for large-scale scene point clouds is a basic capability that is essential for many real-world intelligence applications, such as autonomous driving or simultaneous localization and mapping. Because a large-scale scene point cloud has various characteristics such as large volumes of data, irregular local density, and limited overlapping, most existing approaches can only be trained and operated using small-scale point clouds. In this work, we propose a deep direct registration network, named DDRNet, to efficiently align the point clouds of large-scale scenes. The DDRNet consists of three parts: a local-spatially aware encoder that can efficiently aggregate posture information containing both local and spatial features; an attentional weighting module that allows our network to self-adaptively focus on overlapping areas; and a pyramid transformation decoder used to estimate transformation based on features having different resolutions. Also, we propose a partially-subsampled strategy, which is compatible with any learning-based registration method, to enable the network to be trained and tested in a self-supervised manner. We comprehensively validated our network's efficiency and robustness using four datasets: the ModelNet40, 3Dmatch, S3DIS, and the KITTI odometry datasets. The results demonstrate that our approach is more efficient and performs better than state-of-the-art methods, including both classical and learning-based methods, on large-scale scene data and object point cloud data, but with higher robustness to the variations in point density and overlapping. The efficiency and low registration error will make DDRNet attractive for substantial applications relying on a point cloud registration task.</p>

What problem does this paper attempt to address?

The paper is primarily dedicated to addressing the efficient registration problem of large-scale scene point clouds. Specifically, the study proposes a deep direct registration network named DDRNet (Deep Direct Registration Network) to tackle the characteristics of large-scale scene point clouds, such as massive data volume, irregular local density, and limited overlap. The main contributions of DDRNet include: 1. **Proposing DDRNet**: The network consists of three parts—a local spatial-aware encoder, an attention-weighted module, and a pyramid transformation decoder. These components work together to achieve efficient point cloud registration. - **Local Spatial-Aware Encoder**: Effectively aggregates pose information containing local and spatial features. - **Attention-Weighted Module**: Enables the network to adaptively focus on overlapping regions. - **Pyramid Transformation Decoder**: Estimates transformations based on features at different resolutions, addressing the issue of discarding local structures in existing direct learning methods. 2. **Proposing a Partial Sampling Strategy**: This strategy is compatible with any learning-based registration method, allowing the network to be trained and tested in a self-supervised manner. 3. **Validating the Network's Effectiveness and Robustness**: Comprehensive validation was conducted on multiple datasets, including ModelNet40, 3DMatch, S3DIS, and KITTI range data. The results demonstrate that DDRNet is more efficient and performs better than existing classical and learning-based methods in handling large-scale scene data and object point cloud data, especially in terms of robustness to point density variations and overlap ratio changes. In summary, this research aims to develop a registration algorithm capable of efficiently handling large-scale scene point cloud data to meet the needs of practical applications such as autonomous driving, LiDAR localization, and mapping.

DDRNet: Fast point cloud registration network for large-scale scenes

DeepICP: An End-to-End Deep Neural Network for 3D Point Cloud Registration

DOPNet: Achieving Accurate and Efficient Point Cloud Registration Based on Deep Learning and Multi-Level Features

Efficient low-overlapping point cloud registration based on two-stage network learning

RSKDD-Net: Random Sample-based Keypoint Detector and Descriptor

Sparse and Low-Overlapping Point Cloud Registration Network for Indoor Building Environments

Robust Point Cloud Registration Network for Complex Conditions

HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration

Sparse-to-Dense Matching Network for Large-scale LiDAR Point Cloud Registration

HRegNet: A Hierarchical Network for Large-scale Outdoor LiDAR Point Cloud Registration

PointMBF: A Multi-scale Bidirectional Fusion Network for Unsupervised RGB-D Point Cloud Registration

HRegNet: A Hierarchical Network for Efficient and Accurate Outdoor LiDAR Point Cloud Registration

OKR-Net: Overlapping Keypoints Registration Network for Large-Scale LiDAR Point Clouds

GenReg: Deep Generative Method for Fast Point Cloud Registration

Rethinking of learning-based 3D keypoints detection for large-scale point clouds registration

A Speedy Point Cloud Registration Method Based on Region Feature Extraction in Intelligent Driving Scene

An Efficient and Stable Registration Framework for Large Point Clouds at Two Different Moments

RoCNet: 3D Robust Registration of Point-Clouds using Deep Learning

GaussReg: Fast 3D Registration with Gaussian Splatting

RegGeoNet: Learning Regular Representations for Large-Scale 3D Point Clouds

AMCNet: Adaptive Matching Constraint for Unsupervised Point Cloud Registration.