HFE-Net: Hierarchical Feature Extraction and Coordinate Conversion of Point Cloud for Object 6D Pose Estimation

Ze Shen,Hao Chu,Fei Wang,Yi Guo,Shangdong Liu,Shuai Han
DOI: https://doi.org/10.1007/s00521-023-09241-1
2024-01-01
Abstract:The current challenging problems of learning a robust 6D pose lie in noise in RGB/RGBD images, sparsity of point cloud and severe occlusion. To tackle the problems, object geometric information is critical. In this work, we present a novel pipeline for 6DoF object pose estimation. Unlike previous methods that directly regressing pose parameters and predicting keypoints, we tackle this challenging task with a point-pair based approach and leverage geometric information as much as possible. Specifically, at the representation learning stage, we build a point cloud network locally modeling CNN to encode point cloud, which is able to extract effective geometric features while the point cloud is projected into a high-dimensional space. Moreover, we design a coordinate conversion network to regress point cloud in the object coordinate system in a decoded way. Then, the pose could be calculated through point pairs matching algorithm. Experimental results show that our method achieves state-of-the-art performance on several datasets.
What problem does this paper attempt to address?