Human Pose Estimation Based on Efficient and Lightweight High-Resolution Network (EL-HRNet)

Rui Li,An Yan,Shiqiang Yang,Duo He,Xin Zeng,Hongyan Liu
DOI: https://doi.org/10.3390/s24020396
IF: 3.9
2024-01-10
Sensors
Abstract:As an important direction in computer vision, human pose estimation has received extensive attention in recent years. A High-Resolution Network (HRNet) can achieve effective estimation results as a classical human pose estimation method. However, the complex structure of the model is not conducive to deployment under limited computer resources. Therefore, an improved Efficient and Lightweight HRNet (EL-HRNet) model is proposed. In detail, point-wise and grouped convolutions were used to construct a lightweight residual module, replacing the original 3 × 3 module to reduce the parameters. To compensate for the information loss caused by the network's lightweight nature, the Convolutional Block Attention Module (CBAM) is introduced after the new lightweight residual module to construct the Lightweight Attention Basicblock (LA-Basicblock) module to achieve high-precision human pose estimation. To verify the effectiveness of the proposed EL-HRNet, experiments were carried out using the COCO2017 and MPII datasets. The experimental results show that the EL-HRNet model requires only 5 million parameters and 2.0 GFlops calculations and achieves an AP score of 67.1% on the COCO2017 validation set. In addition, PCKh@0.5mean is 87.7% on the MPII validation set, and EL-HRNet shows a good balance between model complexity and human pose estimation accuracy.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
This paper focuses on the problem of human pose estimation in the field of computer vision. Current methods such as HRNet achieve high accuracy results, but their complex structures make it difficult to deploy with limited computational resources. Therefore, this paper proposes an improved efficient lightweight high-resolution network (EL-HRNet). EL-HRNet builds lightweight residual modules using point convolutions and grouped convolutions to reduce the number of parameters. It also introduces an attention mechanism - Convolutional Block Attention Module (CBAM) to compensate for the information loss caused by network lightweighting, thus achieving high-precision human pose estimation. The effectiveness of EL-HRNet is verified on the COCO2017 and MPII datasets, where it maintains low model complexity while achieving good pose estimation accuracy. The objective of this research is to balance model complexity with pose estimation accuracy and improve the efficiency of the model in practical applications.