A Robot Grasp Relationship Detection Network Based on the Fusion of Multiple Features

Jianning Chi,Xingrui Wu,Changqing Ma,Xiaosheng Yu,Chengdong Wu
DOI: https://doi.org/10.1109/ccdc52312.2021.9602785
2021-01-01
Abstract:Grasp is one of the main ways for robots to interact with the real world. Recently, there are many approaches in grasp detection using deep learning. They can successfully detect one or multi grasp locations from an RGB image. When it comes to a scene with multiple objects, they still need the relationships of the objects to instruct robotics in grasping objects. In this paper, we present a new deep convolutional neural network approach for detecting all potential objects and predicting grasp relationships of them from an RGB image. For each object pair, we firstly generate not only their visual features, but also their spatial masks and semantic embedding vectors from three branches. Then we integrate these features as input to obtain their grasp relationship. Experimental results show that our proposed approach outperforms the state-of-the-art and achieves 73.78% accuracy on the Visual Manipulation Relationship Dataset (VMRD).
What problem does this paper attempt to address?