Abstract:Due to the development of the computer vision, machine learning, and deep learning technologies, the research community focuses not only on the traditional SLAM problems, such as geometric mapping and localization, but also on semantic SLAM. In this paper, we propose a Semantic SLAM system which builds the semantic maps with object-level entities, and it is integrated into the RGB-D SLAM framework. The system combines object detection module that is realized by the deep-learning method, and localization module with RGB-D SLAM seamlessly. In the proposed system, object detection module is used to perform object detection and recognition, and localization module is utilized to get the exact location of the camera. The two modules are integrated together to obtain the semantic maps of the environment. Furthermore, to improve the computational efficiency of the framework, an improved Octomap based on the Fast Line Rasterization Algorithm is constructed. Meanwhile, for the sake of accuracy and robustness of the semantic map, conditional random field is employed to do the optimization. Finally, we evaluate our Semantic SLAM through three different tasks, i.e., localization, object detection, and mapping. Specifically, the accuracy of localization and the mapping speed is evaluated on TUM data set. Compared with ORB-SLAM2 and original RGB-D SLAM, our system, respectively, got 72.9% and 91.2% improvements in dynamic environments localization evaluated by root-mean-square error. With the improved Octomap, the proposed Semantic SLAM is 66.5% faster than the original RGB-D SLAM. We also demonstrate the efficiency of object detection through quantitative evaluation in an automated inventory management task on a real-world data sets recorded over a realistic office.

Monocular Semantic Mapping Based on 3D Cuboids Tracking.

Object-aware Semantic Mapping of Indoor Scenes Using Octomap

Monocular Semantic SLAM using Object-pose-graph Constraints

Semi-Dense 3D Semantic Mapping from Monocular SLAM

From Satellite to Ground: Satellite Assisted Visual Localization with Cross-view Semantic Matching

Monocular SLAM for Large Scale Scenes

Hybrid Semi-Dense 3D Semantic-Topological Mapping From Stereo Visual-Inertial Odometry SLAM With Loop Closure Detection

CubeSLAM: Monocular 3D Object SLAM

Large-Scale 3D Semantic Mapping Using Monocular Vision

Object-Oriented 3D Semantic Mapping Based on Instance Segmentation

Multi-Objective Location and Mapping Based on Deep Learning and Visual Slam

SQ-SLAM: Monocular Semantic SLAM Based on Superquadric Object Representation

Object SLAM Based on Spatial Layout and Semantic Consistency

Multimodal sensor-based semantic 3D mapping for a large-scale environment

Semantic SLAM Based on Object Detection and Improved Octomap

Utilization of Semantic Planes: Improved Localization and Dense Semantic Map for Monocular SLAM in Urban Environment

Semi-Direct Multimap SLAM System for Real-Time Sparse 3-D Map Reconstruction

An Approach for Construct Semantic Map with Scene Classification and Object Semantic Segmentation

Compact 3D Map-Based Monocular Localization Using Semantic Edge Alignment

Building and optimization of 3D semantic map based on Lidar and camera fusion

RS-SLAM: Real time semantic slam with driverless car using LiDAR-Camera-IMU sensing