Abstract:This article introduces BEVPlace++, a novel, fast, and robust LiDAR global localization method for unmanned ground vehicles. It uses lightweight convolutional neural networks (CNNs) on Bird's Eye View (BEV) image-like representations of LiDAR data to achieve accurate global localization through place recognition followed by 3-DoF pose estimation. Our detailed analyses reveal an interesting fact that CNNs are inherently effective at extracting distinctive features from LiDAR BEV images. Remarkably, keypoints of two BEV images with large translations can be effectively matched using CNN-extracted features. Building on this insight, we design a rotation equivariant module (REM) to obtain distinctive features while enhancing robustness to rotational changes. A Rotation Equivariant and Invariant Network (REIN) is then developed by cascading REM and a descriptor generator, NetVLAD, to sequentially generate rotation equivariant local features and rotation invariant global descriptors. The global descriptors are used first to achieve robust place recognition, and the local features are used for accurate pose estimation. Experimental results on multiple public datasets demonstrate that BEVPlace++, even when trained on a small dataset (3000 frames of KITTI) only with place labels, generalizes well to unseen environments, performs consistently across different days and years, and adapts to various types of LiDAR scanners. BEVPlace++ achieves state-of-the-art performance in subtasks of global localization including place recognition, loop closure detection, and global localization. Additionally, BEVPlace++ is lightweight, runs in real-time, and does not require accurate pose supervision, making it highly convenient for deployment. The source codes are publicly available at <a class="link-external link-https" href="https://github.com/zjuluolun/BEVPlace" rel="external noopener nofollow">this https URL</a>.

SphereVLAD++: Attention-Based and Signal-Enhanced Viewpoint Invariant Descriptor

Context for LiDAR-based Place Recognition

A Panoramic Localizer Based on Coarse-to-Fine Descriptors for Navigation Assistance

CFVL: A Coarse-to-Fine Vehicle Localizer with Omnidirectional Perception Across Severe Appearance Variations

Persistent Stereo Visual Localization on Cross-Modal Invariant Map

FreSCo: Frequency-Domain Scan Context for LiDAR-based Place Recognition with Translation and Rotation Invariance

2-Entity RANSAC for Robust Visual Localization in Changing Environment

2-Entity Random Sample Consensus for Robust Visual Localization: Framework, Methods, and Verifications

Spherical Transformer for LiDAR-based 3D Recognition

Local Descriptor for Robust Place Recognition using LiDAR Intensity

Efficient LiDAR Odometry for Autonomous Driving.

Attention-Enhanced Cross-modal Localization Between Spherical Images and Point Clouds

BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles

Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching

VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition

Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition

SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition

Object Scan Context: Object-centric Spatial Descriptor for Place Recognition within 3D Point Cloud Map

BEVPlace: Learning LiDAR-based Place Recognition using Bird's Eye View Images

A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation