6D pose estimation of 3D objects in scenes with mutual similarities and occlusions

Tiandi Chen,Lifeng Sun
DOI: https://doi.org/10.1007/978-3-030-00665-5_93
2019-01-01
Abstract:Estimation of six degrees of freedom (6DoF) attitude of rigid bodies is an essential issue in such fields as robotics and virtual reality. This paper proposed a method that could accurately estimate 6DoF attitude of known rigid bodies with RGB and RGB-D input image data. Additionally, the method could also work well in the scenes in which objects exhibit mutual similarities and occlusion. As one of the contributions made by this paper, a modularized assembly line method was proposed, which integrated deep learning and multi-view geometry method. At first, a neural network for instance segmentation was used to identify the general locations of known objects in the images and give the bounding boxes and masks. Then 6DoF attitude was estimated roughly according to the local features of RGB-D images and templates. Finally, purely geometric method was used to refine the estimation. Another contribution of this paper was the correction of misclassification with the help of some information reserved in the process of training the network. The proposed method achieves a superior performance on a challenging public dataset.
What problem does this paper attempt to address?