Pose-Guided Human Parsing By An And/Or Graph Using Pose-Context Features

Fangting Xia,Jun Zhu,Peng Wang,Alan L. Yuille
DOI: https://doi.org/10.1609/aaai.v30i1.10460
2016-01-01
Abstract:Parsing human into semantic parts is crucial to human-centric analysis. In this paper, we propose a human parsing pipeline that uses pose cues, i.e., estimates of human joint locations, to provide pose-guided segment proposals for semantic parts. These segment proposals are ranked using standard appearance cues, deep-learned semantic feature, and a novel pose feature called pose-context. Then these proposals are selected and assembled using an And-Or graph to output a parse of the person. The And-Or graph is able to deal with large human appearance variability due to pose, choice of clothes, etc. We evaluate our approach on the popular Penn-Fudan pedestrian parsing dataset, showing that it significantly outperforms the state-of-the-arts, and perform diagnostics to demonstrate the effectiveness of different stages of our pipeline.
What problem does this paper attempt to address?