Depth-Based Lightweight Feature Fusion Network for Category-Level 6D Pose Estimation

Hui Yang,Wei Sun,Jian Liu,Chongpei Liu,Xing Zhang,Song Li
DOI: https://doi.org/10.1109/cac59555.2023.10451707
2023-01-01
Abstract:Estimating the 6D object pose has attracted a quantity of research attention because it is significant for intelligence manufacturing, autonomous driving, and virtual reality. Previous works have focused on using complex network structures to improve the accuracy of pose estimation, ignoring real-time performance. In this paper, a lightweight and effective network is proposed for high-precision category-level 6D pose estimation based on depth image. Specifically, our network has three steps. Firstly, we use PointNet++ to extract the geometric features of the instance and the shape prior, respectively. Then, the extracted geometric features are input to the proposed lightweight feature fusion encoder. By doing so, the shape prior geometric features are introduced into the instance to make its shape information more reliable, and the geometric information of the instance is fused into the shape prior so that it can have a better generalization effect on different instances within a category. Finally, the normalized object coordinate space (NOCS) model is reconstructed according to the output of the feature fusion encoder, and the 6D object pose is solved by point cloud registration of the reconstructed NOCS model. The experimental results on the benchmark REAL275 and CAMERA25 datasets demonstrate that the proposed network achieves excellent performance using only depth image.
What problem does this paper attempt to address?