A Real-Time Fusion Framework for Long-term Visual Localization

Yuchen Yang,Xudong Zhang,Shuang Gao,Jixiang Wan,Yishan Ping,Yuyue Liu,Jijunnan Li,Yandong Guo
DOI: https://doi.org/10.48550/arXiv.2210.09757
2022-10-18
Abstract:Visual localization is a fundamental task that regresses the 6 Degree Of Freedom (6DoF) poses with image features in order to serve the high precision localization requests in many robotics applications. Degenerate conditions like motion blur, illumination changes and environment variations place great challenges in this task. Fusion with additional information, such as sequential information and Inertial Measurement Unit (IMU) inputs, would greatly assist such problems. In this paper, we present an efficient client-server visual localization architecture that fuses global and local pose estimations to realize promising precision and efficiency. We include additional geometry hints in mapping and global pose regressing modules to improve the measurement quality. A loosely coupled fusion policy is adopted to leverage the computation complexity and accuracy. We conduct the evaluations on two typical open-source benchmarks, 4Seasons and OpenLORIS. Quantitative results prove that our framework has competitive performance with respect to other state-of-the-art visual localization solutions.
Computer Vision and Pattern Recognition,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve efficient and high - precision visual localization in long - time - changing scenes. Specifically, the paper focuses on how to improve the precision and efficiency of visual localization by fusing global and local pose estimations under degradation conditions such as motion blur, illumination change and environmental change. The paper proposes a real - time fusion framework based on the client - server architecture, uses additional geometric hints and IMU inputs to improve the measurement quality, and adopts a loosely - coupled fusion strategy to balance the computational complexity and accuracy. This framework aims to overcome the long - tail problems that cannot be solved by single - visual information and provides a solution that can maintain high performance under various challenging conditions.