Abstract:The 6-D pose estimation is a crucial task in vision-based measurement for robotic manipulation. It becomes a challenging task because of the variety of lighting conditions, cluttered background, occlusion, and texture-less objects. The various lighting conditions and texture-less objects lead to dramatic changes for imaging. In this article, we propose an edge-attention 6-D pose estimation network (EANet) for texture-less objects that achieve the autonomous perception of the edge, which is invariant when the lighting conditions change and objects are texture-less. To achieve this, EANet adopts a multitask learning strategy that introduces edge cues into the pixelwise dense fusion framework. We design a shared-weight edge extractor serving for edge reconstruction and pose estimation simultaneously. The purpose of edge reconstruction is to guide the network to pay more attention to edges, so as to enhance the performance of pose estimation implicitly. The edge cues can further improve the performance of pose refinement. Furthermore, we apply a skip-connection scheme to the edge extractor, merging feature maps of the deep and shallow layers. Due to the lightweight design, EANet obtains an inference time of 20 frames per second (FPS) on a desktop PC with an RTX 2070 GPU, which satisfies the requirement of most real-time applications. Experiments show that our network has a significant effect on texture-less objects and approaches the state-of-the-art performance on both the LINEMOD and the YCB-Video datasets. Finally, we present a PARTS dataset to test real-world performance and apply EANet to bin-picking vision measurement, which demonstrates that our method can provide a complete and robust solution for vision-based pose measurement.

EP-Net: More Efficient Pose Estimation Network with the Classification-based Key-points Detection

3D Point-to-Keypoint Voting Network for 6D Pose Estimation

Context-Guided Adaptive Network for Efficient Human Pose Estimation.

PAV-Net: Point-wise Attention Keypoints Voting Network for Real-time 6D Object Pose Estimation

EANet: Edge-Attention 6D Pose Estimation Network for Texture-Less Objects

PVNet: Pixel-Wise Voting Network for 6dof Object Pose Estimation.

Cascaded Pyramid Network for Multi-Person Pose Estimation

Attention Guided 6D Object Pose Estimation with Multi-constraints Voting Network

PVN3D: A Deep Point-wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Robust 6-DoF Pose Estimation under Hybrid Constraints

A Dynamic Keypoints Selection Network for 6DoF Pose Estimation

A 3D Keypoints Voting Network for 6DoF Pose Estimation in Indoor Scene

6-D Object Pose Estimation Based on Point Pair Matching for Robotic Grasp Detection

REG-Net: Improving 6DoF Object Pose Estimation with 2D Keypoint Long-Short-Range-Aware Registration

Multi-View Keypoints for Reliable 6D Object Pose Estimation

Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation.

KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation

SEMPose: A Single End-to-end Network for Multi-object Pose Estimation

EP-Net: Improving Point Cloud Learning Efficiency Through Feature Decoupling

Instance-level 6D pose estimation based on multi-task parameter sharing for robotic grasping