Abstract:Visual localization plays a critical role in the functionality of low-cost autonomous mobile robots. Contemporary leading methods for precise visual localization are predominantly 3D scene-specific, necessitating extra computational and memory overhead to construct a 3D scene model in novel environments. An alternative approach of directly using a database of 2D images for visual localization offers more flexibility. However, such methods currently suffer from limited localization accuracy. In this paper, we propose an accurate and robust multiple checking-based 3D model-free visual localization system to address the aforementioned issues. To ensure high accuracy, our focus is on estimating the pose of a query image relative to the retrieved database images using 2D-2D feature matches. Theoretically, by incorporating the local planar motion constraint into both the estimation of the essential matrix and the triangulation stages, we reduce the minimum required feature matches for absolute pose estimation, thereby enhancing the robustness of outlier rejection. Additionally, we introduce a multiple-checking mechanism to ensure the correctness of the solution throughout the solving process. The efficacy of our approach is substantiated through both qualitative and quantitative assessments on simulated and two real-world datasets evidencing significant improvements in accuracy and robustness provided by our 3D model-free visual localization system. Note to Practitioners-The motivation of this article stems from the need to develop an accurate visual localization system with simplicity and flexibility of map construction and easy adaption to new environments. Such a system holds great practical value for a range of applications, including warehouse robots, service robots, and countless others. Existing visual localization systems that achieve high accuracy are dependent on a pre-built accurate 3D scene map, which pose challenges in terms of map construction and consume significant storage resources onboard, particularly for large scenes. And the aforementioned efforts need to be repeated when changing to a new scene. In this article, an accurate and robust 3D model-free visual localization system is proposed to handle this problem. The map construction is simplified to build a set of database images with associated camera poses, which is trivial as it amounts to adding posed images to a database. The core idea for achieving high accuracy and robustness is to model the local planar motion characteristic of general ground-moving robots into both essential matrix estimation and triangulation stages to obtain two minimal solutions. The proposed localization system simplifies the task of switching between different application scenarios for the robot, reducing additional workload and lowering the difficulty of use.

Learn to Triangulate Scene Coordinates for Visual Localization

3D Model-free Visual Localization System from Essential Matrix under Local Planar Motion

2-Entity RANSAC for Robust Visual Localization in Changing Environment

Long-Term Map-Based Visual Localization: Analysis of Individual Components of a Hierarchical Pipeline

Leveraging Local Planar Motion Property for Robust Visual Matching and Localization.

3D LiDAR-Based Global Localization Using Siamese Neural Network

LocNet: Global Localization in 3D Point Clouds for Mobile Vehicles

2-Entity Random Sample Consensus for Robust Visual Localization: Framework, Methods, and Verifications

Visual Localization in a Prior 3D LiDAR Map Combining Points and Lines

An Efficient Scene Coordinate Encoding and Relocalization Method

R-SCoRe: Revisiting Scene Coordinate Regression for Robust Large-Scale Visual Localization

CurriculumLoc: Enhancing Cross-Domain Geolocalization Through Multistage Refinement

Scene Coordinate Regression with Angle-Based Reprojection Loss for Camera Relocalization

CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement

SGLoc: Scene Geometry Encoding for Outdoor LiDAR Localization

Large Scale Joint Semantic Re-Localisation and Scene Understanding via Globally Unique Instance Coordinate Regression

Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras

Global Localization Based on Road-centric 3D Point Cloud Descriptor in Urban Environments

Decoupling Features and Coordinates for Few-shot RGB Relocalization

Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer

Learning Scene Adaptive Covariance Error Model of LiDAR Scan Matching for Fusion Based Localization