Abstract:Multi-camera-based simultaneous localization and mapping (SLAM) has been widely applied in various mobile robots under uncertain or unknown environments to accomplish tasks autonomously. However, the conventional purely data-driven feature extraction methods cannot utilize the rich semantic information in the environment, which leads to the performance of the SLAM system being susceptible to various interferences. In this work, we present a semantic-aware multi-level information fusion scheme for robust global orientation estimation. Specifically, a visual semantic perception system based on the synthesized surround view image is proposed for the multi-eye surround vision system widely used in mobile robots, which is used to obtain the visual semantic information required for SLAM tasks. The original multi-eye image was first transformed to the synthesized surround view image, and the passable space was extracted with the help of the semantic segmentation network model as a mask for feature extraction; moreover, the hybrid edge information was extracted to effectively eliminate the distorted edges by further using the distortion characteristics of the reverse perspective projection process. Then, the hybrid semantic information was used for robust global orientation estimation; thus, better localization performance was obtained. The experiments on an intelligent vehicle, which was used for automated valet parking both in indoor and outdoor scenes, showed that the proposed hybrid multi-level information fusion method achieved at least a 10-percent improvement in comparison with other edge segmentation methods, the average orientation estimation error being between 1 and 2 degrees, much smaller than other methods, and the trajectory drift value of the proposed method was much smaller than that of other methods.

Vision Global Localization with Semantic Segmentation and Interest Feature Points

3D LiDAR-Based Global Localization Using Siamese Neural Network

Persistent Stereo Visual Localization on Cross-Modal Invariant Map

Unifying Terrain Awareness Through Real-Time Semantic Segmentation

Multimodal Localization: Stereo over LiDAR Map

Communication Constrained Cloud-Based Long-Term Visual Localization in Real Time.

LocNet: Global Localization in 3D Point Clouds for Mobile Robots.

Monocular Localization with Semantics Map for Autonomous Vehicles

Leveraging Local Planar Motion Property for Robust Visual Matching and Localization.

From Satellite to Ground: Satellite Assisted Visual Localization with Cross-view Semantic Matching

Visual Semantic Localization based on HD Map for Autonomous Vehicles in Urban Scenarios

Map-assisted Visual Localization Using Line Features in Urban Area

Semantic Image Alignment for Vehicle Localization

Visual-Marker-Based Localization for Flat-Variation Scene

Semantic-Structure-Aware Multi-Level Information Fusion for Robust Global Orientation Optimization of Autonomous Mobile Robots

RoadMap: A Light-Weight Semantic Map for Visual Localization towards Autonomous Driving

A Method of Vision Aided GNSS Positioning Using Semantic Information in Complex Urban Environment

Monocular Vehicle Self-localization Method Based on Compact Semantic Map

Learning Visual Semantic Map-Matching for Loosely Multi-Sensor Fusion Localization of Autonomous Vehicles

Robust and Precise Vehicle Localization based on Multi-sensor Fusion in Diverse City Scenes

Beyond Cross-view Image Retrieval: Highly Accurate Vehicle Localization Using Satellite Image