Dilated Hourglass Networks for Human Pose Estimation

Yudong Zhang,Jing Liu,Kaiyu Huang
DOI: https://doi.org/10.1109/cac.2018.8623582
2018-01-01
Abstract:Human pose estimation, a research hotspot of computer vision, has been a key and hard task. Recently, significant progress has been achieved using the newly emergent deep convolutional neural network technique. Many methods first apply the bottom-up and top-down structure, which uses first downsampling and then upsampling. This structure can capture multi-scale feature but lead to loss of information due to downsampling. In this work, we propose a Dilated Convolutional Module (DCM) based on dilated convolution and skip connections. The proposed module, which aims to reduce the loss of information, can expand receptive field and reduce the times of downsampling. Using DCMs as building blocks, this work proposes a novel convolutional network named Dilated Hourglass Network, based on Stacked Hourglass Network. The proposed network is compared with the Stacked Hourglass Network on the standard benchmarks MPII dataset for human pose estimation, which shows that the proposed network achieves a notable progress.
What problem does this paper attempt to address?