LHFF-Net: Local heterogeneous feature fusion network for 6DoF pose estimation

Fei Wang,Zhenquan He,Xing Zhang,Yong Jiang
DOI: https://doi.org/10.1007/s13042-021-01364-y
2021-07-13
International Journal of Machine Learning and Cybernetics
Abstract:<p class="a-plus-plus">Pose estimation based on RGB-D images is a hot issue that has attracted much attention in recent years. A key technical challenge is to extract features from depth information and image information separately and fully leverage the two complementary data sources. The previous methods ignored the internal connection of local features and the feature fusion of heterogeneous data, limiting the robustness and real-time performance in cluttered scenes. In this article, we propose LHFF-Net, a generic framework based on dynamic graph convolution to strengthen the information aggregation among all point clouds in a local region. After extracting heterogeneous features, we fuse information from two data sources in different receptive fields, to estimate the pose of the object while fully extracting local features. We show in experiments that the proposed approach outperforms state-of-the-art approaches on two challenging data sets, YCB-Video and LineMOD. We also have deployed our proposed method on the UR5 robot for grasping experiments and achieved good grasping performance.</p>
computer science, artificial intelligence
What problem does this paper attempt to address?