Discovering HOI Semantics from Massive Image Data

Mingguang Zheng,Shouhong Wan,Peiquan Jin
DOI: https://doi.org/10.1007/978-3-030-86475-0_25
2021-01-01
Abstract:Human-Object Interaction (HOI) plays an important role in human-centric scene understanding. However, the commonly used two-stage methods have large computational costs and a slow inferring speed. The existing one-stage methods detect HOIs by detecting the central points or the union boxes of human and objects, which need to process a large scale of regions and many unnecessary features. In this paper, we propose a novel one-stage method for discovering HOI semantics from massive image data. In particular, we present two new designs in our method, namely action classification and displacement prediction. Further, we design a special HOI score calculation strategy, which can decay the HOI score of the results that have bad matching result. We evaluate our method on the popular HICO-DET benchmark and compare our proposal with a number of existing approaches. The results show that our method outperforms existing methods in discovering HOI semantics. abstract environment.
What problem does this paper attempt to address?