Human Pose Estimation Based on Efficient and Lightweight High-Resolution Network (EL-HRNet)

Rui Li,An Yan,Shiqiang Yang,Duo He,Xin Zeng,Hongyan Liu

DOI: https://doi.org/10.3390/s24020396

IF: 3.9

2024-01-10

Sensors

Abstract:As an important direction in computer vision, human pose estimation has received extensive attention in recent years. A High-Resolution Network (HRNet) can achieve effective estimation results as a classical human pose estimation method. However, the complex structure of the model is not conducive to deployment under limited computer resources. Therefore, an improved Efficient and Lightweight HRNet (EL-HRNet) model is proposed. In detail, point-wise and grouped convolutions were used to construct a lightweight residual module, replacing the original 3 × 3 module to reduce the parameters. To compensate for the information loss caused by the network's lightweight nature, the Convolutional Block Attention Module (CBAM) is introduced after the new lightweight residual module to construct the Lightweight Attention Basicblock (LA-Basicblock) module to achieve high-precision human pose estimation. To verify the effectiveness of the proposed EL-HRNet, experiments were carried out using the COCO2017 and MPII datasets. The experimental results show that the EL-HRNet model requires only 5 million parameters and 2.0 GFlops calculations and achieves an AP score of 67.1% on the COCO2017 validation set. In addition, PCKh@0.5mean is 87.7% on the MPII validation set, and EL-HRNet shows a good balance between model complexity and human pose estimation accuracy.

engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation

What problem does this paper attempt to address?

This paper focuses on the problem of human pose estimation in the field of computer vision. Current methods such as HRNet achieve high accuracy results, but their complex structures make it difficult to deploy with limited computational resources. Therefore, this paper proposes an improved efficient lightweight high-resolution network (EL-HRNet). EL-HRNet builds lightweight residual modules using point convolutions and grouped convolutions to reduce the number of parameters. It also introduces an attention mechanism - Convolutional Block Attention Module (CBAM) to compensate for the information loss caused by network lightweighting, thus achieving high-precision human pose estimation. The effectiveness of EL-HRNet is verified on the COCO2017 and MPII datasets, where it maintains low model complexity while achieving good pose estimation accuracy. The objective of this research is to balance model complexity with pose estimation accuracy and improve the efficiency of the model in practical applications.

Human Pose Estimation Based on Efficient and Lightweight High-Resolution Network (EL-HRNet)

X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention

Context-Guided Adaptive Network for Efficient Human Pose Estimation.

A-HRNet: Attention Based High Resolution Network for Human Pose Estimation

Lightweight high-resolution network based on adaptive cross-dimensional weighting for human pose estimation

Adaptively Fusing Complete Multi-resolution Features for Human Pose Estimation.

EG-HRNet: an Efficient High-Resolution Network Using Ghost-Modules for Human Pose Estimation

Lite-HRNet: A Lightweight High-Resolution Network

An improved lightweight high-resolution network based on multi-dimensional weighting for human pose estimation

Lightweight high-performance pose recognition network: HR-LiteNet

Multi-Stage HRNet: Multiple Stage High-Resolution Network for Human Pose Estimation

Research on Lightweight High-resolution Network Human Pose Estimation Based on Self-attention

Simple and Lightweight Human Pose Estimation

Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention

Optimized S2E Attention Block based Convolutional Network for Human Pose Estimation

Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation

BiHRNet: A Binary high-resolution network for Human Pose Estimation

DSC-HRNet: a lightweight teaching pose estimation model with depthwise separable convolution and deep high-resolution representation learning in computer-aided education

Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation

Ghost attentional down net: An effective lightweight top-down network for human pose estimation

HEViTPose: High-Efficiency Vision Transformer for Human Pose Estimation