Multistage Attention Network for Human Pose Estimation

Jingyang Zhou,Guangzhao Wen,Yu Zhang,Xin Geng
DOI: https://doi.org/10.1117/1.jei.31.6.063001
IF: 0.829
2022-01-01
Journal of Electronic Imaging
Abstract:Human pose estimation is a fundamental yet challenging task in computer vision. Although many methods have achieved significant improvement, they are still insufficient for the fusion of feature maps at different stages, such as the stacked hourglass network (SHNet). The SHNet is a classic human pose estimation network that extracts multiscale features through stacked multistage downsampling and upsampling operations. We propose a multistage attention mechanism to fuse the multistage feature maps. Furthermore, we apply it in the SHNet to propose a multistage attention network (MANet). In the experiments, we demonstrated the effectiveness of MANet in human pose estimation on the common objects in context dataset and the MPII human pose dataset. (c) 2022 SPIE and IS&T
What problem does this paper attempt to address?