Abstract:Six-degree-of-freedom (6DoF) object pose estimation is a critical task for robot manipulation, autonomous vehicles, and augmented reality. Category-level 6DoF object pose estimation is trending because it can generalize to same-category unknown objects. However, existing mean shape based methods do not consider that predicting adjustment must model shape differences, which makes these methods still suffer from shape variations among same-category objects, limiting their accuracy. Also, existing methods overlook the importance of object segmentation to 6DoF pose estimation and use an RGB-based object segmentation method with low accuracy. To address these problems, we propose difference-aware shape adjustment and RGB-D feature fusion-based object segmentation for category-level 6DoF object pose estimation. The proposed method encodes shape differences, improving mean shape adjustment and alleviating same-category shape variations. Specifically, a difference-aware shape adjustment network (DASAN) is proposed to model shape differences between the object instance and mean shape by feature subtraction with an attention mechanism. We also propose an RGB-D feature fusion-based object segmentation method that uses a coarse-to-fine framework: a 2D detector and a novel RGB-D feature fusion-based binary classification network for coarse and fine segmentation. Experiments on two well-known datasets demonstrate the proposed method’s state-of-the-art (SOTA) pose estimation accuracy. In addition, we construct comparative experiments on the latest dataset (Wild6D) and a self-collected dataset (OBJECTS) and achieve high accuracies, demonstrating the strong generalizability of the proposed method. Also, we apply the proposed method to unknown object grasping, thus demonstrating the practicability of the proposed method.

Semantic Segmentation and 6DoF Pose Estimation using RGB-D Images and Deep Neural Networks

6D Pose Estimation Method of Metal Parts for Robotic Grasping Based on Semantic-Level Line Matching

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

Deep instance segmentation and 6D object pose estimation in cluttered scenes for robotic autonomous grasping

6D pose estimation of 3D objects in scenes with mutual similarities and occlusions

6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping

RGB-D-Based Pose Estimation of Workpieces with Semantic Segmentation and Point Cloud Registration

A Robotic Semantic Grasping Method for Pick-and-place Tasks

Six-dimensional Target Pose Estimation for Robot Autonomous Manipulation: Methodology and Verification

6D Pose Estimation of Industrial Parts Based on Point Cloud Geometric Information Prediction for Robotic Grasping

6-DoF grasp estimation method that fuses RGB-D data based on external attention

A Manufacturing-Oriented Intelligent Vision System Based on Deep Neural Network for Object Recognition and 6D Pose Estimation

A Method for Unseen Object Six Degrees of Freedom Pose Estimation Based on Segment Anything Model and Hybrid Distance Optimization

Fine segmentation and difference-aware shape adjustment for category-level 6DoF object pose estimation

6IMPOSE: bridging the reality gap in 6D pose estimation for robotic grasping

RFFCE: Residual Feature Fusion and Confidence Evaluation Network for 6dof Pose Estimation.

6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation

6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection

Instance segmentation based 6D pose estimation of industrial objects using point clouds for robotic bin-picking

Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

A Practical Robotic Grasping Method by Using 6-D Pose Estimation With Protective Correction