Efficient Fully Convolutional Network and Optimization Approach for Robotic Grasping Detection Based on RGB-D Images

Wei Nie,Xinwu Liang
DOI: https://doi.org/10.1007/s12204-023-2615-x
2023-01-01
Journal of Shanghai Jiaotong University (Science)
Abstract:Robot grasp detection is a fundamental vision task for robots. Deep learning-based methods have shown excellent results in enhancing the grasp detection capabilities for model-free objects in unstructured scenes. Most popular approaches explore deep network models and exploit RGB-D images combining colour and depth data to acquire enriched feature expressions. However, current work struggles to achieve a satisfactory balance between the accuracy and real-time performance; the variability of RGB and depth feature distributions receives inadequate attention. The treatment of predicted failure cases is also lacking. We propose an efficient fully convolutional network to predict the pixel-level antipodal grasp parameters in RGB-D images. A structure with hierarchical feature fusion is established using multiple lightweight feature extraction blocks. The feature fusion module with 3D global attention is used to select the complementary information in RGB and depth images sufficiently. Additionally, a grasp configuration optimization method based on local grasp path is proposed to cope with the possible failures predicted by the model. Extensive experiments on two public grasping datasets, Cornell and Jacquard, demonstrate that the approach can improve the performance of grasping unknown objects.
What problem does this paper attempt to address?