Abstract:3D human pose estimation using monocular images is an important yet challenging task. Existing 3D pose detection methods exhibit excellent performance under normal conditions however their performance may degrade due to occlusion. Recently some occlusion aware methods have also been proposed, however, the occlusion handling capability of these networks has not yet been thoroughly investigated. In the current work, we propose an occlusion-guided 3D human pose estimation framework and quantify its occlusion handling capability by using different protocols. The proposed method estimates more accurate 3D human poses using 2D skeletons with missing joints as input. Missing joints are handled by introducing occlusion guidance that provides extra information about the absence or presence of a joint. Temporal information has also been exploited to better estimate the missing joints. A large number of experiments are performed for the quantification of occlusion handling capability of the proposed method on three publicly available datasets in various settings including random missing joints, fixed body parts missing, and complete frames missing, using mean per joint position error criterion. In addition to that, the quality of the predicted 3D poses is also evaluated using action classification performance as a criterion. 3D poses estimated by the proposed method achieved significantly improved action recognition performance in the presence of missing joints. Our experiments demonstrate the effectiveness of the proposed framework for handling the missing joints as well as quantification of the occlusion handling capability of the deep neural networks.

Occlusion-aware Hand Pose Estimation Using Hierarchical Mixture Density Network

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution.

Unsupervised Universal Hierarchical Multi-Person 3D Pose Estimation for Natural Scenes

3D Human Pose Estimation using Spatio-Temporal Networks with Explicit Occlusion Training

Explicit Occlusion Reasoning for Multi-person 3D Human Pose Estimation

Spatial Attention Deep Net with Partial PSO for Hierarchical Hybrid Hand Pose Estimation.

Cascaded hierarchical CNN for 2D hand pose estimation from a single color image

Occlusion-Aware Human Pose Estimation with Mixtures of Sub-Trees

Fast and Accurate 3D Hand Pose Estimation via Recurrent Neural Network for Capturing Hand Articulations

Monocular 3D Human Pose Estimation by Predicting Depth on Joints

Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images

A hybrid network for estimating 3D interacting hand pose from a single RGB image

HDPose: Post-Hierarchical Diffusion with Conditioning for 3D Human Pose Estimation

Hand Pose Estimation with Attention-and-Sequence Network.

Hand3D: Hand Pose Estimation using 3D Neural Network

A comprehensive framework for occluded human pose estimation

Recurrent 3D Hand Pose Estimation Using Cascaded Pose-Guided 3D Alignments

Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework

Occluded Human Pose Estimation based on Limb Joint Augmentation

3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal

Attention Guided 6D Object Pose Estimation with Multi-constraints Voting Network