Pose Graph Parsing Network for Human-Object Interaction Detection

Zhan Su,Yuting Wang,Qing Xie,Ruiyun Yu
DOI: https://doi.org/10.1016/j.neucom.2021.12.085
IF: 6
2021-01-01
Neurocomputing
Abstract:The detection of interactions between humans and objects is one of the core issues in the area of scene understanding in image analysis. The conventional method is to pair the human body with the object as an entity and pay attention to the human spatial area and object. However, this method does not consider two key aspects: humans use certain body parts to interact with objects, and correlations exist between different body parts. Thus, in this paper, we propose a pose graph parsing network (PGPN) for human-object interaction detection. Specifically, we construct a multibranch network to study high-level semantic features. In addition to emphasizing the appearance area of each instance in an image, feature propagation based on a pose graph is further adopted to consider the features of correlation between different body parts. Furthermore, a branch refines and captures the relationship between human parts and an object using a human pose. We validate this approach on the V-COCO and HICO-DET datasets and compare it with the state-of-the-arts. In comparison to other models, the PGPN superior and significantly improves the performance of human-object interaction detection.
What problem does this paper attempt to address?