Abstract:Hand pose and shape estimation plays an important role in numerous applications. A cost-effective and practical-friendly approach is to perform accurate hand estimation from a single RGB image, but this task is challenging due to ubiquitous hand self-occlusion and hand-object interaction occlusions. In this paper, we propose a novel SPMHand network to alleviate the effect of occlusions, inspired by the process that humans infer the whole hand when the hand is occluded. The proposed SPMHand consists of two main modules to generate hand segmentations as guidance and conduct hand regressions in a progressive multi-path manner. The segmentation-guided deocclusion module enables the network to see the occluded hand by inferring the whole hand segmentation. Specifically, the visible hand segmentation is first obtained and then a hand morphology attention block is introduced to infer the whole hand segmentation by fusing the visible information with the learned hand features. The progressive multi-path regression module is designed to gradually regress the fine hand with intermediate supervisions. Features from deep to shallow are utilized for the hand regressions from coarse to decent. Subsequently, the structure feature, joint heatmaps and segmentations that provide guidance for deocclusion are embedded and fused for the final fine hand regression. Experiments on four challenging datasets illustrate that the proposed SPMHand outperforms the state-of-the-arts in both real-world and synthetic scenes, especially in the present of severe hand-object occlusions.

SegPoseNet: Segmentation-Guided 3D Hand Pose Estimation

SPMHand: Segmentation-guided Progressive Multi-path 3D Hand Pose and Shape Estimation

QMGR-Net: quaternion multi-graph reasoning network for 3D hand pose estimation

HandFormer: Hand Pose Reconstructing from a Single RGB Image

3D Hand Shape and Pose Estimation from a Single RGB Image

RetinaHand: Towards Accurate Single-Stage Hand Pose Estimation

Multistage 3D Hand Pose Estimation Algorithm Based on Skeleton Points

3D Hand Pose Estimation and Shape Reconstruction Based on Multi-task Learning

Hand3D: Hand Pose Estimation using 3D Neural Network

Interacting two-hand instance segmentation and pose estimation based on attention-induced separation

Simultaneous 3D Hand Detection and Pose Estimation Using Single Depth Images

A Multi-task Interaction Mechanism for 3D Hand Pose Estimation from RGB Image

Learning Hand Latent Features for Unsupervised 3D Hand Pose Estimation

3D Hand Pose and Shape Estimation from Monocular RGB Via Efficient 2D Cues

Towards Good Practices for Deep 3D Hand Pose Estimation

Weakly Supervised Segmentation Guided Hand Pose Estimation During Interaction with Unknown Objects.

Attention-Based Pose Sequence Machine for 3D Hand Pose Estimation

HMTNet:3D Hand Pose Estimation from Single Depth Image Based on Hand Morphological Topology

DeepHPS: End-to-end Estimation of 3D Hand Pose and Shape by Learning from Synthetic Depth

A hybrid network for estimating 3D interacting hand pose from a single RGB image

Silhouette-Net: 3D Hand Pose Estimation from Silhouettes