Abstract:Category-level pose estimation is a challenging problem due to intra-class shape variations. Recent methods deform pre-computed shape priors to map the observed point cloud into the normalized object coordinate space and then retrieve the pose via post-processing, i.e., Umeyama's Algorithm. The shortcomings of this two-stage strategy lie in two aspects: 1) The surrogate supervision on the intermediate results can not directly guide the learning of pose, resulting in large pose error after post-processing. 2) The inference speed is limited by the post-processing step. In this paper, to handle these shortcomings, we propose an end-to-end trainable network SSP-Pose for category-level pose estimation, which integrates shape priors into a direct pose regression network. SSP-Pose stacks four individual branches on a shared feature extractor, where two branches are designed to deform and match the prior model with the observed instance, and the other two branches are applied for directly regressing the totally 9 degrees-of-freedom pose and performing symmetry reconstruction and point-wise inlier mask prediction respectively. Consistency loss terms are then naturally exploited to align the outputs of different branches and promote the performance. During inference, only the direct pose regression branch is needed. In this manner, SSP-Pose not only learns category-level pose-sensitive characteristics to boost performance but also keeps a real-time inference speed. Moreover, we utilize the symmetry information of each category to guide the shape prior deformation, and propose a novel symmetry-aware loss to mitigate the matching ambiguity. Extensive experiments on public datasets demon-strate that SSP-Pose produces superior performance compared with competitors with a real-time inference speed at about 25Hz. The codes will be released soon.

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

Zero-Shot 3d Pose Estimation of Unseen Object by Two-Step Rgb-D Fusion

DualPoseNet: Category-level 6D Object Pose and Size Estimation Using Dual Pose Network with Refined Learning of Pose Consistency

GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Leveraging SE(3) Equivariance for Self-Supervised Category-Level Object Pose Estimation

Synthetic Depth Image-based Category-Level Object Pose Estimation with Effective Pose Decoupling and Shape Optimization

Fine segmentation and difference-aware shape adjustment for category-level 6DoF object pose estimation

TG-Pose: Delving Into Topology and Geometry for Category-Level Object Pose Estimation

DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation

Semi-Decoupled 6D Pose Estimation Via Multi-Modal Feature Fusion

Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation

SD-Pose: Semantic Decomposition for Cross-Domain 6D Object Pose Estimation

HS-Pose: Hybrid Scope Feature Extraction for Category-level Object Pose Estimation

Learning Geometric Consistency and Discrepancy for Category-Level 6D Object Pose Estimation from Point Clouds

MH6D: Multi-Hypothesis Consistency Learning for Category-Level 6-D Object Pose Estimation

SSP-Pose: Symmetry-Aware Shape Prior Deformation for Direct Category-Level Object Pose Estimation

CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge

GPV-Pose: Category-level Object Pose Estimation Via Geometry-guided Point-wise Voting

DONet: Learning Category-Level 6D Object Pose and Size Estimation from Depth Observation