DDRNet: Fast point cloud registration network for large-scale scenes

Zhenghua Zhang,Guoliang Chen,Xuan Wang,Mingcong Shu
DOI: https://doi.org/10.1016/j.isprsjprs.2021.03.003
IF: 12.7
2021-05-01
ISPRS Journal of Photogrammetry and Remote Sensing
Abstract:<p>Efficient registration for large-scale scene point clouds is a basic capability that is essential for many real-world intelligence applications, such as autonomous driving or simultaneous localization and mapping. Because a large-scale scene point cloud has various characteristics such as large volumes of data, irregular local density, and limited overlapping, most existing approaches can only be trained and operated using small-scale point clouds. In this work, we propose a deep direct registration network, named DDRNet, to efficiently align the point clouds of large-scale scenes. The DDRNet consists of three parts: a local-spatially aware encoder that can efficiently aggregate posture information containing both local and spatial features; an attentional weighting module that allows our network to self-adaptively focus on overlapping areas; and a pyramid transformation decoder used to estimate transformation based on features having different resolutions. Also, we propose a partially-subsampled strategy, which is compatible with any learning-based registration method, to enable the network to be trained and tested in a self-supervised manner. We comprehensively validated our network's efficiency and robustness using four datasets: the ModelNet40, 3Dmatch, S3DIS, and the KITTI odometry datasets. The results demonstrate that our approach is more efficient and performs better than state-of-the-art methods, including both classical and learning-based methods, on large-scale scene data and object point cloud data, but with higher robustness to the variations in point density and overlapping. The efficiency and low registration error will make DDRNet attractive for substantial applications relying on a point cloud registration task.</p>
imaging science & photographic technology,remote sensing,geography, physical,geosciences, multidisciplinary
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the efficient registration problem of large-scale scene point clouds. Specifically, the study proposes a deep direct registration network named DDRNet (Deep Direct Registration Network) to tackle the characteristics of large-scale scene point clouds, such as massive data volume, irregular local density, and limited overlap. The main contributions of DDRNet include: 1. **Proposing DDRNet**: The network consists of three parts—a local spatial-aware encoder, an attention-weighted module, and a pyramid transformation decoder. These components work together to achieve efficient point cloud registration. - **Local Spatial-Aware Encoder**: Effectively aggregates pose information containing local and spatial features. - **Attention-Weighted Module**: Enables the network to adaptively focus on overlapping regions. - **Pyramid Transformation Decoder**: Estimates transformations based on features at different resolutions, addressing the issue of discarding local structures in existing direct learning methods. 2. **Proposing a Partial Sampling Strategy**: This strategy is compatible with any learning-based registration method, allowing the network to be trained and tested in a self-supervised manner. 3. **Validating the Network's Effectiveness and Robustness**: Comprehensive validation was conducted on multiple datasets, including ModelNet40, 3DMatch, S3DIS, and KITTI range data. The results demonstrate that DDRNet is more efficient and performs better than existing classical and learning-based methods in handling large-scale scene data and object point cloud data, especially in terms of robustness to point density variations and overlap ratio changes. In summary, this research aims to develop a registration algorithm capable of efficiently handling large-scale scene point cloud data to meet the needs of practical applications such as autonomous driving, LiDAR localization, and mapping.