Global Point Cloud Registration Network for Large Transformations

Hanz Cuevas-Velasquez,Alejandro Galán-Cuenca,Antonio Javier Gallego,Marcelo Saval-Calvo,Robert B. Fisher
2024-03-27
Abstract:Three-dimensional data registration is an established yet challenging problem that is key in many different applications, such as mapping the environment for autonomous vehicles, and modeling objects and people for avatar creation, among many others. Registration refers to the process of mapping multiple data into the same coordinate system by means of matching correspondences and transformation estimation. Novel proposals exploit the benefits of deep learning architectures for this purpose, as they learn the best features for the data, providing better matches and hence results. However, the state of the art is usually focused on cases of relatively small transformations, although in certain applications and in a real and practical environment, large transformations are very common. In this paper, we present ReLaTo (Registration for Large Transformations), an architecture that faces the cases where large transformations happen while maintaining good performance for local transformations. This proposal uses a novel Softmax pooling layer to find correspondences in a bilateral consensus manner between two point sets, sampling the most confident matches. These matches are used to estimate a coarse and global registration using weighted Singular Value Decomposition (SVD). A target-guided denoising step is then applied to both the obtained matches and latent features, estimating the final fine registration considering the local geometry. All these steps are carried out following an end-to-end approach, which has been shown to improve 10 state-of-the-art registration methods in two datasets commonly used for this task (ModelNet40 and KITTI), especially in the case of large transformations.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily focuses on addressing a key issue in 3D Point Cloud Registration (PCR), specifically the global registration problem under large transformations. Specifically, the goal of the paper is to achieve efficient and accurate registration when dealing with point cloud data that undergoes significant rotational and/or translational changes. To achieve this goal, the authors propose a new architecture called ReLaTo (Registration for Large Transformations). This method addresses the problem through the following steps: 1. **Bilateral Consensus Matching**: ReLaTo introduces a novel bilateral consensus estimation criterion to determine the correspondences between the source and target point clouds. This matching strategy ensures that the optimal matching point pairs are found in both directions. 2. **Softmax Pooling Layer**: The paper proposes a Softmax pooling layer that can learn the confidence scores between point pairs in an unsupervised manner and sample the best matching point pairs from them. 3. **Coarse-to-Fine Registration Strategy**: ReLaTo employs a coarse-to-fine registration strategy, first estimating a rough global registration based on the results of the Softmax pooling layer, and then further optimizing the registration results through a technique called "target-guided denoising" to improve local geometric accuracy. 4. **End-to-End Network Architecture**: The entire process is completed within an end-to-end neural network framework, allowing the model to learn the final transformation matrix directly from the raw data without the need for explicit staged feature extraction, matching, and transformation estimation. Experiments in the paper demonstrate that the proposed ReLaTo method performs excellently in scenarios involving large transformations, significantly outperforming other state-of-the-art registration methods on datasets such as ModelNet40 and KITTI. Additionally, ReLaTo shows robustness in handling noisy and incomplete data, which is crucial for many practical applications.