Abstract:Human pose estimation plays a critical role in human-centred vision applications. Its influence extends to various aspects of daily life, from healthcare diagnostics and sports training to augmented reality experiences and gesture-controlled interfaces. While current approaches have achieved impressive accuracy, their high model complexity and slow detection speeds significantly limit their deployment on edge devices with limited computing power, such as mobile phones and IoT devices. In this paper, we introduce a novel lightweight network for 2D human pose estimation, called lightweight stochastic depth network (LSDNet). Our approach is based on the observation that the majority of HRNet's parameters are located in the middle and later stages in the network. We reduce some unnecessary branches to significantly reduce these parameters. This is achieved by leveraging the Bernoulli distribution to randomly remove these redundant branches, which improves the network's efficiency while also increasing its robustness. To further reduce the network's parameter count, we introduce two lightweight blocks with simple yet effective architectures. These blocks achieve significant parameter reduction while maintaining good accuracy. Furthermore, we leverage coordinate attention to effectively fuse features from different branches and scales. This mechanism captures both inter-channel dependencies and spatial context, enabling the network to accurately localize keypoints across the human body. We evaluated the effectiveness of our method on the MPII and COCO datasets, demonstrating superior results on human pose estimation compared to popular lightweight networks. Our code is available at: https://github.com/illusory2333/LSDNet.

LGCANet: lightweight hand pose estimation network based on HRNet

Context-Guided Adaptive Network for Efficient Human Pose Estimation.

X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention

Real-Time Facial Landmark Detection by Attention-driven Lightweight Network

Ghost attentional down net: An effective lightweight top-down network for human pose estimation

Lightweight high-performance pose recognition network: HR-LiteNet

Human Pose Estimation Based on Efficient and Lightweight High-Resolution Network (EL-HRNet)

A Lightweight Hand Attitude Estimation Method Based on GCN Feature Enhancement

Research on Lightweight High-resolution Network Human Pose Estimation Based on Self-attention

EG-HRNet: an Efficient High-Resolution Network Using Ghost-Modules for Human Pose Estimation

Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation

LSDNet: lightweight stochastic depth network for human pose estimation

A Lightweight Hand-Gesture Recognition Network With Feature Fusion Prefiltering and FMCW Radar Spatial Angle Estimation

A Robust Context Attention Network for Human Hand Detection

Human Pose Estimation Based on Lightweight Multi-Scale Coordinate Attention

HAN: An Efficient Hierarchical Self-Attention Network for Skeleton-Based Gesture Recognition

InterNet+: A Light Network for Hand Pose Estimation

A-HRNet: Attention Based High Resolution Network for Human Pose Estimation

Lite-HRNet: A Lightweight High-Resolution Network

FGDSNet: A Lightweight Hand Gesture Recognition Network for Human Robot Interaction

Lightweight high-resolution network based on adaptive cross-dimensional weighting for human pose estimation