Depth Privileged Object Detection in Indoor Scenes Via Deformation Hallucination.

Zhijie Zhang,Yan Liu,Junjie Chen,Li Niu,Liqing Zhang
DOI: https://doi.org/10.1609/aaai.v35i4.16459
2021-01-01
Proceedings of the AAAI Conference on Artificial Intelligence
Abstract:RGB-D object detection has achieved significant advance, because depth provides complementary geometric information to RGB images. Considering that depth images are unavailable in some scenarios, we focus on depth privileged object detection in indoor scenes, where the depth images are only available in the training stage. Under this setting, one prevalent research line is modality hallucination, in which depth image and depth feature are common hallucination targets. In contrast, we choose to hallucinate depth deformation, which benefits a lot from rich geometric information in depth data. Specifically, we employ the deformable convolutional layer with augmented offsets to perform geometric deformation, because the offsets enable flexibly sampling over the object and transforming to a canonical shape for ease of object detection. In addition, we design a quality-based weighted transfer loss to avoid negative transfer of depth deformation. Experimental results on NYUDv2 and SUN RGB-D demonstrate the effectiveness of our method against the state-of-the-art methods for depth privileged object detection.
What problem does this paper attempt to address?