Abstract:In this paper we present a CNN based approach for a real time 3D-hand pose estimation from the depth sequence. Prior discriminative approaches have achieved remarkable success but are facing two main challenges: Firstly, the methods are fully supervised hence require large numbers of annotated training data to extract the dynamic information from a hand representation. Secondly, unreliable hand detectors based on strong assumptions or a weak detector which often fail in several situations like complex environment and multiple hands. In contrast to these methods, this paper presents an approach that can be considered as semi-supervised by performing predictive coding of image sequences of hand poses in order to capture latent features underlying a given image without supervision. The hand is modelled using a novel latent tree dependency model (LDTM) which transforms internal joint location to an explicit representation. Then the modeled hand topology is integrated with the pose estimator using data dependent method to jointly learn latent variables of the posterior pose appearance and the pose configuration respectively. Finally, an unsupervised error term which is a part of the recurrent architecture ensures smooth estimations of the final pose. Experiments on three challenging public datasets, ICVL, MSRA, and NYU demonstrate the significant performance of the proposed method which is comparable or better than state-of-the-art approaches.

Deep Conditional Variational Estimation for Depth-Based Hand Poses

Cross-Modal Deep Variational Hand Pose Estimation

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images

Improve Regression Network on Depth Hand Pose Estimation with Auxiliary Variable.

Learning a Deep Predictive Coding Network for a Semi-Supervised 3D-Hand Pose Estimation

Learning Hand Latent Features for Unsupervised 3D Hand Pose Estimation

Deep Predictive Neural Network: Unsupervised Learning for Hand Pose Estimation

Hand3D: Hand Pose Estimation using 3D Neural Network

3D Hand Pose Estimation Using Synthetic Data and Weakly Labeled RGB Images.

Pixel-wise Regression: 3D Hand Pose Estimation via Spatial-form Representation and Differentiable Decoder

Hand Pose Estimation Using Convolutional Neural Networks and Support Vector Regression.

Model-based Deep Hand Pose Estimation

Hand Pose Estimation via Latent 2.5D Heatmap Regression

Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation.

3D Hand Pose Estimation with Disentangled Cross-Modal Latent Space

3D Hand Shape and Pose from Images in the Wild

Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks

Residual Attention Regression For 3d Hand Pose Estimation

Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network

NETWORKS EFFECTIVELY UTILIZING 2D SPATIAL INFORMATION FOR ACCURATE 3D HAND POSE ESTIMATION

3D Hand Pose Estimation via Regularized Graph Representation Learning