An Object Perception and Positioning Method Via Deep Perception Learning Object Detection

Limei Xiao,Yachao Zhang,Weizhe Gao,Dayou Xu,Ce Li
DOI: https://doi.org/10.1002/cpe.6203
2021-01-01
Concurrency and Computation Practice and Experience
Abstract:One of the fundamental problems when building perception systems for robot is to be able to provide semantic information as well as positioning in three‐dimensional (3D) space. However, two‐dimensional (2D) object detectors only can provide the semantic information and pixel coordinate in 2D space. While, the depth image can reflect the relative distance, and the semantic description of the object is poor. In this article, a novel object perception and positioning method via deep perception learning object detection is proposed. First, the RGB image and depth image are collected through the Kinect, and the depth image is processed to ensure the robustness of the model. Then, the RGB image can obtain the object semantic and pixel location information through an object detector based on deep learning. Finally, the object size measurement and 3D positioning are realized by combining the pixel location and the depth information. As a result, the advantages of very accurate 2D detector and the accurate depth information can be effectively captured in our model. Experimental results demonstrate that our method achieves a high accuracy of size measurement and spatial positioning.
What problem does this paper attempt to address?