End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation

Thomas Pöllabauer,Jiayin Li,Volker Knauthe,Sarah Berkei,Arjan Kuijper

2024-09-18

Abstract:6D object pose estimation is the problem of identifying the position and orientation of an object relative to a chosen coordinate system, which is a core technology for modern XR applications. State-of-the-art 6D object pose estimators directly predict an object pose given an object observation. Due to the ill-posed nature of the pose estimation problem, where multiple different poses can correspond to a single observation, generating additional plausible estimates per observation can be valuable. To address this, we reformulate the state-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End Probabilistic Geometry-Guided Regression). Instead of predicting a single pose per detection, we estimate a probability density distribution of the pose. Using the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose Estimation) Challenge, we test our approach on four of its core datasets and demonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and ITODD. Our probabilistic solution shows that predicting a pose distribution instead of a single pose can improve state-of-the-art single-view pose estimation while providing the additional benefit of being able to sample multiple meaningful pose candidates.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper attempts to address the issues of pose ambiguity and inaccuracy in 6 degrees of freedom (6DoF) object pose estimation. Specifically: 1. **Pose Ambiguity**: In certain situations, the same observation may correspond to multiple different poses. For example, when an object is partially occluded or has symmetry, a single pose prediction may not accurately describe the true pose of the object. Therefore, generating multiple plausible pose estimates is valuable. 2. **Pose Estimation Inaccuracy**: Existing pose estimation methods typically predict a single pose from a single observation, which can lead to inaccuracies due to scene characteristics such as occlusion. To tackle these issues, the authors improved the existing state-of-the-art algorithm GDRNPP and introduced EPRO-GDR (End-to-End Probabilistic Geometry-Guided Regression). The main improvements of EPRO-GDR are: - **Probability Density Distribution Prediction**: Instead of directly predicting a single pose, EPRO-GDR estimates a probability density distribution of poses, allowing for the generation of multiple meaningful pose candidates. - **Uncertainty Representation**: Through the probability density distribution, EPRO-GDR not only provides pose estimates but also offers a measure of uncertainty for any sampled pose. These improvements enable EPRO-GDR to outperform the baseline method GDRNPP on multiple datasets, particularly excelling in handling heavily occluded and low-texture objects.

End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting

Prior Geometry Guided Direct Regression Network for Monocular 6D Object Pose Estimation

RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images

A Geometry-Enhanced 6D Pose Estimation Network with Incomplete Shape Recovery for Industrial Parts

BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation with Geometric Consistency

Real-Time and Efficient 6-D Pose Estimation from a Single RGB Image

DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation

REDE: End-to-End Object 6D Pose Robust Estimation Using Differentiable Outliers Elimination

Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images

GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting

Geo6D: Geometric Constraints Learning for 6D Pose Estimation

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting

DGECN++: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism

End-to-End Differentiable 6DoF Object Pose Estimation with Local and Global Constraints

Exploring Multiple Geometric Representations for 6DoF Object Pose Estimation

End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation

6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting

Prior Geometry Guided Direct Regression Network for Monocular 6D Object Pose Estimation

RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images

A Geometry-Enhanced 6D Pose Estimation Network with Incomplete Shape Recovery for Industrial Parts

BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

GeoPose: Dense Reconstruction Guided 6D Object Pose Estimation with Geometric Consistency

Real-Time and Efficient 6-D Pose Estimation from a Single RGB Image

DGECN: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation

REDE: End-to-End Object 6D Pose Robust Estimation Using Differentiable Outliers Elimination

Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB Images

GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting

Geo6D: Geometric Constraints Learning for 6D Pose Estimation

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting

DGECN&#x002B;&#x002B;: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism

End-to-End Differentiable 6DoF Object Pose Estimation with Local and Global Constraints

Exploring Multiple Geometric Representations for 6DoF Object Pose Estimation

DGECN++: A Depth-Guided Edge Convolutional Network for End-to-End 6D Pose Estimation via Attention Mechanism