Indoor Instance-Aware Semantic Mapping Using Instance Segmentation

Yinpeng Jiang,Xudong Ma,Fang,Xuewen Kang
DOI: https://doi.org/10.1109/ccdc52312.2021.9602282
2021-01-01
Abstract:In order to accomplish the requirement of scene understanding to complete various kinds of complex tasks in home environment for robots, a novel instance segmentation method is adopted to build an instance-level 3D semantic map and obtain information such as categories, positions and interrelationship of instance objects within the environment. Different from the previous method which focuses on a certain feature in geometry or vision, we synchronously learn the features of geometric and visual information, distinguish instance objects and background areas and create the feature voxel grid of the environment. The proposed 3D-RPN network takes the grid as input and makes use of the cuboid bounding box to predict each instance and the category it represents. With the mask prediction branch, we binarized voxels in each bounding box to determine the exact distribution of the instance object. Our method borrows the idea of Mask R-CNN and the main body is constructed by 3D and 2D convolutional network, making full use of the features of 2D and 3D. We have tested our method on ScanNet and S3DIS, two large-scale indoor scene data sets, and the experiment has verified that our method can find and identify the instance information more accurately than previous methods.
What problem does this paper attempt to address?