Exploring Depth Information for Spatial Relation Recognition

Xuewei Ding,Yehao Li,Yingwei Pan,Dan Zeng,Ting Yao
DOI: https://doi.org/10.1109/MIPR49039.2020.00065
2020-01-01
Abstract:It is always well believed that modeling the relative depth information between objects would be helpful for recognizing the spatial relations between pairs of objects in images, especially for the spatial relation like "behind" and "in front of." Nevertheless, there has not been evidence in support of the idea on spatial relation recognition. In this paper, we present a novel Depth-guided Spatial Relation Recognizer (DSRR) to predict the spatial predicate from object pairs under the umbrella of relative depth information in between. Particularly, DSRR capitalizes on the off-the-shelf depth estimator to predict the depth information for each object. The depth cues for each pair of objects are further integrated with language (object name) and 2D (bounding box coordinates) cues to perform spatial relation reasoning. Extensive experiments conducted on SpatialSense dataset validate our proposal and superior results are reported when comparing to state-of-the-art models.
What problem does this paper attempt to address?