Abstract:The 6-D pose estimation is a critical work essential to achieve reliable robotic grasping. Currently, the prevalent method is reliant on keypoint correspondence. However, this approach hinges on the determination of object keypoint locations, alongside their detection and localization in real scenes. It also employs the random sample consensus (RANSAC)-based perspective-n-point (PnP) algorithm to solve the pose. Yet, it is nondifferentiable and incapable of backpropagation with loss during the training phase. Alternatively, the direct regression method, while speedy and differentiable, falls short in terms of pose estimation performance, and thus needs enhancement. In view of these gaps, we investigate PPM6D, a new method for 6-D object pose estimation based on regression and point pair matching. Our methodology begins with a proposed cross-fusion module, designed to achieve the fusion and complementation of RGB features and point cloud features. Subsequently, an attention module adjusts the features of the object's 3-D model. Finally, we design a point pair matching module for effective matching of points and characteristics, resulting in an integral matching and fusion. PPM6D is extensively trained and tested utilizing benchmark datasets like LINEMOD, occlusion LINEMOD (LINEMOD-occ), YCB-Video, and T-LESS dataset. Experimental results prove that PPM6D can outperform many keypoint-based pose estimation methods, given its relatively rapid speed, thereby offering novel regression-based pose estimation ideas. When applied to real-world scenarios of object pose estimation tasks and grasp tasks of an actual Baxter robot, PPM6D demonstrates superior performance as compared to most alternatives.

Pose-guided Auto-Encoder and Feature-Based Refinement for 6-Dof Object Pose Regression

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

FEIF: Feature Excitation and Interactive Fusion for 6D Object Pose Estimation.

Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation

RFFCE: Residual Feature Fusion and Confidence Evaluation Network for 6dof Pose Estimation.

Efficient Encoding and Aligning Viewpoints for 6D Pose Estimation of Unseen Industrial Parts

PA-Pose: Partial Point Cloud Fusion Based on Reliable Alignment for 6D Pose Tracking

PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation

A Geometry-Enhanced 6D Pose Estimation Network with Incomplete Shape Recovery for Industrial Parts

REG-Net: Improving 6DoF Object Pose Estimation with 2D Keypoint Long-Short-Range-Aware Registration

Iterative Pose Refinement for Object Pose Estimation Based on RGBD Data

CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-Dof Object Pose Estimation

Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images

SO-Pose: Exploiting Self-Occlusion for Direct 6D Pose Estimation

NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation

DeepRM: Deep Recurrent Matching for 6D Pose Refinement

6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images