SegICP: Integrated Deep Semantic Segmentation and Pose Estimation

Jay M. Wong,Vincent Kee,Tiffany Le,Syler Wagner,Gian-Luca Mariottini,Abraham Schneider,Lei Hamilton,Rahul Chipalkatty,Mitchell Hebert,David M.S. Johnson,Jimmy Wu,Bolei Zhou,Antonio Torralba
DOI: https://doi.org/10.1109/IROS.2017.8206470
2017-09-06
Abstract:Recent robotic manipulation competitions have highlighted that sophisticated robots still struggle to achieve fast and reliable perception of task-relevant objects in complex, realistic scenarios. To improve these systems' perceptive speed and robustness, we present SegICP, a novel integrated solution to object recognition and pose estimation. SegICP couples convolutional neural networks and multi-hypothesis point cloud registration to achieve both robust pixel-wise semantic segmentation as well as accurate and real-time 6-DOF pose estimation for relevant objects. Our architecture achieves 1cm position error and <5^\circ$ angle error in real time without an initial seed. We evaluate and benchmark SegICP against an annotated dataset generated by motion capture.
Robotics,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?