PLPose: A Bottom-up Lightweight Pose Estimation Detection Model

Qi Xiao,Ru Zhao,Gang Shi,Dapeng Deng
DOI: https://doi.org/10.1109/cvidl62147.2024.10603654
2024-01-01
Abstract:Bottom-up pose estimation model has important practical significance for resource-constrained scenarios. The task involves detecting human skeletal keypoints and processing limb correlation. Detecting human skeletal keypoints is a finegrained localization task, while limb correlation requires a large receptive field to capture the dependency between keypoints. Therefore, to enhance the performance of the pose estimation model requires comprehensive consideration of both fine-grained features and the size of the receptive field. In this paper, we investigate a precise lightweight openpose (PLPose) using mobilenetv3 as a backbone network. The feature extraction and region focusing effects at key points of the human body are enhanced by the design and introduction of encoder-decoder blocks. While maintaining a large receptive field, multi-scale skip connections are introduced to transfer information from the underlying features to the reconstructed high-level features, enabling a more comprehensive feature representation of the model. Finally, the accuracy of keypoint localisation is improved by weighting the feature map according to the spatial distribution tendency of the keypoint location through the multi-channel synergistic fusion mechanism. The results show that the optimisation strategy adopted in this paper strikes a good balance between accuracy, model size and real-time performance in the pose estimation task.
What problem does this paper attempt to address?