Scene Recognition and Object Detection in a Unified Convolutional Neural Network on a Mobile Manipulator

Hao Sun,Zehui Meng,Pey Yuen Tao,Marcelo H. Ang
DOI: https://doi.org/10.1109/icra.2018.8460535
2018-05-01
Abstract:Environment understanding, object detection and recognition are crucial skills for robots operating in the real world. In this paper, we propose a Convolutional Neural Network with multi-task objectives: object detection and scene classification in one unified architecture. The proposed network reasons globally about an image to understand the scene, hypothesize object locations, and encodes global scene features with regional object features to improve object recognition. We evaluate our network on the standard SUN RGBD dataset. Experiments show that our approach outperforms state-of-the-arts. Network predictions are further transformed into continuous robot beliefs to ensure temporal coherence and extended to 3D space for robotics applications. We embed the whole framework in Robot Operating System, and evaluate its performance on a real robot for semantic mapping and grasp detection.
What problem does this paper attempt to address?