SimpleCut: A Simple and Strong 2D Model for Multi-Person Pose Estimation

Tewodros Legesse Munea,Chenhui Yang,Chenxi Huang,Mohammed A. M. Elhassan,Qingkai Zhen
DOI: https://doi.org/10.1016/j.cviu.2022.103509
IF: 4.886
2022-01-01
Computer Vision and Image Understanding
Abstract:This article proposes a simple and efficient multi-person pose estimation model which follows a bottomup approach and is based on a few deconvolutional layers added on a U-net lookalike ResNet featuremap. SimpleCut contains four independent modules: joints module, coordinates (coords) module, main-joint pairing module, and other-joints pairing module. The joints module builds a score on joints of each individual on the image, whereas the coords module encodes the location of those joints, and both the pairing modules generate image-conditioned pairing of the joints on a small scale. The pairing modules help set up the proposals into a variable number of consistent body part configurations by an optimization strategy that efficiently brings significant speed-up factors. We demonstrated that simultaneously inferring these bottom-up representations of detection and association encode global context sufficiently well to allow a greedy parse to attain highquality results with low computational cost. SimpleCut evaluated on three publicly available large-scale dataset benchmarks such as ms-coco, lspet, and mpii human pose dataset.
What problem does this paper attempt to address?