C2FNet: Coarse-to-Fine Keypoint Localization Network for Monocular 6D Object Pose Estimation

Jiahao Sun,Xin Ma,Yibin Li
DOI: https://doi.org/10.1109/cac59555.2023.10451826
2023-01-01
Abstract:Estimating the 6D object pose from a singular RGB image is a fundamental task in the field of computer vision. Recent studies have demonstrated that keypoint-based methods exhibit remarkable efficacy. Such methodologies initially identify keypoints and subsequently deduce the object”s pose by addressing a Perspective-n-Point (PnP) problem. Nonetheless., the prevalent approach of directly regressing the 2D coordinates of keypoints renders these methods sensitive to occlusions. Instead., we inaugurate a coarse-to-fine keypoint localization network (C2FNet)., where we utilize the predicted keypoint offsets from the first stage as input to deformable convolutional networks (DCN) in the second stage., to further capture geometric features and enhance spatial relationships between keypoints., thus outputting accurate and robust keypoint coordinates. Experiments indicate that the proposed approach surpasses existing advanced methods in terms of performance on the LINEMOD and Occlusion LINEMOD datasets.
What problem does this paper attempt to address?