Attention Residual Network with 3D convolutional neural network for 3D Human Pose Estimation.

Jianyu Yan,Kuizhi Mei
DOI: https://doi.org/10.1109/RCAR52367.2021.9517546
2021-01-01
Abstract:3D human pose estimation is a very important task in human-computer interaction. Most of state-of-art methods are based 3D convolutional neural from voxelized grid or 2D convolutional neural network from depth images. The perspective distortions problems may badly effect the performance of the depth image-based methods. The degradation problem may badly effect the performance of 3D CNN-based methods. To overcome the perspective distortions problem, we use voxelized gird as input of proposed network. The output of proposed network is a heatmap for each joint, and then our method gets 3D coordinate of joint from the heatmap. To overcome the degradation problem and gradient vanishing problem, our network use residual blocks, which can make deeper network be easier to be trained. The attention mechanism can help our method focus on the more important information. We evaluated our method on a public human pose dataset and a human pose dataset collected by ourself. The experimental results show the efficiency of our method.
What problem does this paper attempt to address?