Distance-Aware Vector-Field and Vector Screening Strategy for 6D Object Pose Estimation.

Lichun Wang ,Chao Yang,Jianjia Xin,Baocai Yin
DOI: https://doi.org/10.1007/978-3-031-46308-2_31
2023-01-01
Abstract:6D object pose estimation calculates the rotation and translation matrices from the object coordinate system to the camera coordinate system and plays an important role in tasks such as robotic grasping. The voting-based 6D pose estimation method PVNet votes on a set of hypotheses to determine one as the estimation for real keypoint, and uses Perspective-n-Point (PnP) algorithm to calculate 6D pose based on the estimated keypoints. For improving the accuracy of estimated keypoints, the accuracy of hypotheses should be improved firstly. Since each hypothesis is an intersection computed with extended lines of two predicted unit vectors, three factors should be considered for improving its accuracy. The deviation of angle between predicted vector and real vector should be as small as possible. The angular deviation for predicted vectors of pixels farther away from keypoints should be smaller than that of those nearer. Any two approximately parallel or coincident predicted vectors should be prohibited to compute intersection. In light of the three points, this paper predicts vector-field instead of unit vector-field to take into account the distance from pixel to real keypoint, and proposes a distance-aware vector-field prediction loss which requires that the farther pixels from keypoints, the smaller the angular deviation for predicted vectors, and suggests a strategy for preventing approximately parallel or coincident predicted vectors from computing hypothesis. Experiments on LINEMOD and OCC-LINEMOD datasets show that our method achieves 5.9% and 8.4% improvement for the average accuracy of pose estimation in terms of ADD(-S) respectively compared with PVNet.
What problem does this paper attempt to address?