Monocular Human Pose and Shape Reconstruction Using Part Differentiable Rendering

Min Wang,Feng Qiu,Wentao Liu,Chen Qian,Xiaowei Zhou,Lizhuang Ma
DOI: https://doi.org/10.1111/cgf.14150
IF: 2.5
2020-01-01
Computer Graphics Forum
Abstract:Superior human pose and shape reconstruction from monocular images depends onremoving the ambiguities caused by occlusions and shape variance. Recent workssucceed in regression-based methods which estimate parametric models directlythrough a deep neural network supervised by 3D ground truth. However, 3D groundtruth is neither in abundance nor can efficiently be obtained. In this paper,we introduce body part segmentation as critical supervision. Part segmentationnot only indicates the shape of each body part but helps to infer theocclusions among parts as well. To improve the reconstruction with partsegmentation, we propose a part-level differentiable renderer that enablespart-based models to be supervised by part segmentation in neural networks oroptimization loops. We also introduce a general parametric model engaged in therendering pipeline as an intermediate representation between skeletons anddetailed shapes, which consists of primitive geometries for betterinterpretability. The proposed approach combines parameter regression, bodymodel optimization, and detailed model registration altogether. Experimentalresults demonstrate that the proposed method achieves balanced evaluation onpose and shape, and outperforms the state-of-the-art approaches on Human3.6M,UP-3D and LSP datasets.
What problem does this paper attempt to address?