Abstract:Abstract Estimating the 6D poses of industrial parts is a fundamental task in automated industries. However, the scarcity of industrial part datasets and the effort involved to retrain networks present challenges when estimating poses for unseen parts. Although a few pre-trained networks demonstrate effectiveness on unseen objects, they often struggle to encode correct viewpoint for unseen industrial parts, which have significant geometrical differences compared to the pre-trained objects. Additionally, they overlook the viewpoint non-uniformity that frequently occurs in industrial settings, resulting in significant 3D rotation errors. To address these issues, a novel 6D pose estimator for unseen industrial parts is proposed. First, a Self-to-Inter (S2I) viewpoint encoder is introduced to efficiently generate discriminative descriptors that capture the viewpoint information of the observed image. The S2I viewpoint encoder utilizes an Inter-viewpoint attention module to facilitate prior viewpoint communication and leverages a saliency descriptor selection strategy to boost inference speed. Second, a Viewpoint Alignment module (VAM) is established and integrated with the ICP refiner. The VAM aligns non-uniform viewpoints in an analytical paradigm, leading to enhanced efficiency of the refinement process and more accurate final predictions. Experimental results on the LINEMOD dataset demonstrate competitive performance compared to state-of-the-art methods. Furthermore, the experiments conducted on eight unseen industrial parts validate the exceptional generalizability of our method, highlighting its potential in industrial applications.

PAV-Net: Point-wise Attention Keypoints Voting Network for Real-time 6D Object Pose Estimation

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

Recurrent Volume-based 3D Feature Fusion for Real-time Multi-view Object Pose Estimation

Recurrent Volume-Based 3-D Feature Fusion for Real-Time Multiview Object Pose Estimation.

Context-Guided Adaptive Network for Efficient Human Pose Estimation.

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Attention Guided 6D Object Pose Estimation with Multi-constraints Voting Network

PVNet: Pixel-Wise Voting Network for 6dof Object Pose Estimation.

PA-Pose: Partial Point Cloud Fusion Based on Reliable Alignment for 6D Pose Tracking

A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene

PANet: A Pixel-Level Attention Network for 6D Pose Estimation With Embedding Vector Features

A Dynamic Keypoints Selection Network for 6DoF Pose Estimation

PAM:Point-wise Attention Module for 6D Object Pose Estimation

Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction

EP-Net: More Efficient Pose Estimation Network with the Classification-based Key-points Detection

Efficient Encoding and Aligning Viewpoints for 6D Pose Estimation of Unseen Industrial Parts

PPR-Net: Point-wise Pose Regression Network for Instance Segmentation and 6D Pose Estimation in Bin-picking Scenarios.

KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation

P$^2$GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation

Prior-information-guided corresponding point regression network for 6D pose estimation

Adaptive Pose Estimation Algorithm Based on Point Pair Feature