Toward Real-Time 3D Object Recognition: A Lightweight Volumetric CNN Framework Using Multitask Learning

Shuaifeng Zhi,Yongxiang Liu,Xiang Li,Yulan Guo
DOI: https://doi.org/10.1016/j.cag.2017.10.007
IF: 1.821
2018-01-01
Computers & Graphics
Abstract:3D data are becoming increasingly popular and easier to access, making 3D information increasingly important for object recognition. Although volumetric convolutional neural networks (CNNs) have been exploited to recognize 3D objects and have achieved notable progress, their computational cost is too high for real-time applications. In this paper, we propose a lightweight volumetric CNN architecture (namely, LightNet) to address the real-time 3D object recognition problem leveraging on multitask learning. We use LightNet to simultaneously predict class and orientation labels from complete and partial shapes. In contrast to the earlier version of this method presented at 3DOR 2017, this extended version introduces batch normalization and better training strategies to improve the recognition accuracy, and also includes more experiments on the newly released large-scale ShapeNet Core55 dataset. Our model has been evaluated on three publicly available benchmarks of complete 3D CAD shapes and incomplete point clouds. Experimental results show that our model achieves the state-of-the-art 3D object recognition performance among shallow volumetric CNNs with the smallest number of training parameters. It is also demonstrated that our method can perform accurate object recognition in real time (less than 6 ms). (C) 2017 Elsevier Ltd. All rights reserved.
What problem does this paper attempt to address?