End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation

Thomas Pöllabauer,Jiayin Li,Volker Knauthe,Sarah Berkei,Arjan Kuijper
2024-09-18
Abstract:6D object pose estimation is the problem of identifying the position and orientation of an object relative to a chosen coordinate system, which is a core technology for modern XR applications. State-of-the-art 6D object pose estimators directly predict an object pose given an object observation. Due to the ill-posed nature of the pose estimation problem, where multiple different poses can correspond to a single observation, generating additional plausible estimates per observation can be valuable. To address this, we reformulate the state-of-the-art algorithm GDRNPP and introduce EPRO-GDR (End-to-End Probabilistic Geometry-Guided Regression). Instead of predicting a single pose per detection, we estimate a probability density distribution of the pose. Using the evaluation procedure defined by the BOP (Benchmark for 6D Object Pose Estimation) Challenge, we test our approach on four of its core datasets and demonstrate superior quantitative results for EPRO-GDR on LM-O, YCB-V, and ITODD. Our probabilistic solution shows that predicting a pose distribution instead of a single pose can improve state-of-the-art single-view pose estimation while providing the additional benefit of being able to sample multiple meaningful pose candidates.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issues of pose ambiguity and inaccuracy in 6 degrees of freedom (6DoF) object pose estimation. Specifically: 1. **Pose Ambiguity**: In certain situations, the same observation may correspond to multiple different poses. For example, when an object is partially occluded or has symmetry, a single pose prediction may not accurately describe the true pose of the object. Therefore, generating multiple plausible pose estimates is valuable. 2. **Pose Estimation Inaccuracy**: Existing pose estimation methods typically predict a single pose from a single observation, which can lead to inaccuracies due to scene characteristics such as occlusion. To tackle these issues, the authors improved the existing state-of-the-art algorithm GDRNPP and introduced EPRO-GDR (End-to-End Probabilistic Geometry-Guided Regression). The main improvements of EPRO-GDR are: - **Probability Density Distribution Prediction**: Instead of directly predicting a single pose, EPRO-GDR estimates a probability density distribution of poses, allowing for the generation of multiple meaningful pose candidates. - **Uncertainty Representation**: Through the probability density distribution, EPRO-GDR not only provides pose estimates but also offers a measure of uncertainty for any sampled pose. These improvements enable EPRO-GDR to outperform the baseline method GDRNPP on multiple datasets, particularly excelling in handling heavily occluded and low-texture objects.