Deep learning based 3D target detection for indoor scenes

Ying Liu,Du Jiang,Chao Xu,Ying Sun,Guozhang Jiang,Bo Tao,Xiliang Tong,Manman Xu,Gongfa Li,Juntong Yun
DOI: https://doi.org/10.1007/s10489-022-03888-4
IF: 5.3
2022-08-17
Applied Intelligence
Abstract:3D target detection is a research hotspot in recent years. In the field of autonomous driving, 3D target detection is mainly targeted at outdoor scenes that the camera height is constant. In a few indoor scenes, 3D target detection is mostly at the category level. However, it is difficult to generate instance-level 3D target detection datasets. In complex indoor scenes, instance-level 3D target detection is used as the research object in this paper. The indoor 3D target detection dataset is constructed by Aruco marker. A pixel-by-pixel key point voting network for joint semantic segmentation of RGB images is established, and a new key point assumption strategy is proposed. Combined with depth images, the key point detection is extended to three dimensions, and the bit pose is optimized by the ICP algorithm. The evaluation metrics and visualization of the model are analyzed and compared. It is tested and visualized under validation set, truncated validation set and unlabeled. The generalization of the method in this paper is proved, and 3D target detection in indoor scene based on RGB image and RGB-D image is achieved.
computer science, artificial intelligence
What problem does this paper attempt to address?