Abstract:Depth sensor based 3D human motion estimation hardware such as Kinect has made interactive applications more popular recently. However, it is still challenging to accurately recognize postures from a single depth camera due to the inherently noisy data derived from depth images and self-occluding action performed by the user. In this paper, we propose a new real-time probabilistic framework to enhance the accuracy of live captured postures that belong to one of the action classes in the database. We adopt the Gaussian Process model as a prior to leverage the position data obtained from Kinect and marker-based motion capture system. We also incorporate a temporal consistency term into the optimization framework to constrain the velocity variations between successive frames. To ensure that the reconstructed posture resembles the accurate parts of the observed posture, we embed a set of joint reliability measurements into the optimization framework. A major drawback of Gaussian Process is its cubic learning complexity when dealing with a large database due to the inverse of a covariance matrix. To solve the problem, we propose a new method based on a local mixture of Gaussian Processes, in which Gaussian Processes are defined in local regions of the state space. Due to the significantly decreased sample size in each local Gaussian Process, the learning time is greatly reduced. At the same time, the prediction speed is enhanced as the weighted mean prediction for a given sample is determined by the nearby local models only. Our system also allows incrementally updating a specific local Gaussian Process in real time, which enhances the likelihood of adapting to run-time postures that are different from those in the database. Experimental results demonstrate that our system can generate high quality postures even under severe self-occlusion situations, which is beneficial for real-time applications such as motion-based gaming and sport training.

Temporal-Spatial local gaussian process experts for human pose estimation

Human motion tracking by temporal-spatial local gaussian process experts

Temporal Constrained Feasible Subspace Learning for Human Pose Forecasting

Exploring Severe Occlusion: Multi-Person 3D Pose Estimation with Gated Convolution.

Discriminative Estimation of 3D Human Pose Using Gaussian Processes.

Latent Gaussian Mixture Regression for Human Pose Estimation

Monocular Tracking 3D People by Gaussian Process Spatio-Temporal Variable Model

GLPose: Global-Local Representation Learning for Human Pose Estimation

STN-enhanced Message Passing Guided by Adversarial Learning for Human Pose Estimation

Gaussian process for human motion modeling: A comparative study

An Improved 3D Human Pose Estimation Model Based on Temporal Convolution with Gaussian Error Linear Units

Multi-Channel Spatio-Temporal GCN for Human Pose Forecasting

SPGNet: Spatial Projection Guided 3D Human Pose Estimation in Low Dimensional Space

Kinect Posture Reconstruction Based on a Local Mixture of Gaussian Process Models

Generative Estimation Of 3d Human Pose Using Shape Contexts Matching

GLA-GCN: Global-local Adaptive Graph Convolutional Network for 3D Human Pose Estimation from Monocular Video

Discriminative Learning of Visual Words for 3D Human Pose Estimation

Towards Locality Similarity Preserving to 3D Human Pose Estimation.

Exploring Temporal Consistency for Human Pose Estimation in Videos

Learning Temporal-Spatial Contextual Adaptation for Three-Dimensional Human Pose Estimation

Human Pose Estimation with Regression by Fusing Multi-View Visual Information