2D-to-3D Projection for Monocular and Multi-View 3D Multi-class Object Detection in Indoor Scenes

D. D. Rukhovich,
DOI: https://doi.org/10.17587/prin.12.459-469
2021-12-16
PROGRAMMNAYA INGENERIA
Abstract:In this paper, we propose a novel method of joint 3D object detection and room layout estimation. The proposed method surpasses all existing methods of 3D object detection from monocular images on the indoor SUN RGB-D dataset. Moreover, the proposed method shows competitive results on the ScanNet dataset in multi-view mode. Both these datasets are collected in various residential, administrative, educational and industrial spaces, and altogether they cover almost all possible use cases. Moreover, we are the first to formulate and solve a problem of multi-class 3D object detection from multi-view inputs in indoor scenes. The proposed method can be integrated into the controlling systems of mobile robots. The results of this study can be used to address a navigation task, as well as path planning, capturing and manipulating scene objects, and semantic scene mapping.
What problem does this paper attempt to address?