Channel sifted model for pose estimation
Shuren Zhou,Liang Peng
DOI: https://doi.org/10.1007/s10489-022-04091-1
IF: 5.3
2022-09-03
Applied Intelligence
Abstract:In recent years, human pose estimation, also called keypoint estimation, has been a research spot in the field of computer vision. However, most methods focus on how to achieve high accuracy and ignore the computational cost. These methods usually use deeper layers and more complex structures to construct their networks result in a surge in computational cost, so it is difficult to apply them in real environments with real-time requirements. Some methods try to reduce the computational cost by decreasing precision of parameter or using simple structures. However, they can only achieve low performance, so it is difficult to apply in the real environments. Inspired by lightweight network design, this paper proposes a channel sifted method for human pose estimation, which is called the channel sifted network (CSN). A lightweight ResNet is used as the backbone, and a channel scoring module (CSM) is applied. Both parts aim to balance the computational cost and prediction accuracy of the network. The lower the computational cost, the faster the inference speed. In the experimental part, we first train and test the network on the COCO2017 dataset and MPII dataset respectively, and then demonstrate the effectiveness of lightweight backbone and CSM in the network through ablation study. Compared with other lightweight models, CSN achieves higher performance.
computer science, artificial intelligence
What problem does this paper attempt to address?