Abstract:In order to better solve the visual detection problem of manipulator grasping non-cooperative targets, we propose a method of grasp pose detection based on pixel point and feature fusion. By using the improved U2net network as the backbone for feature extraction and feature fusion of the input image, and the grasp prediction layer detects the grasp pose on each pixel. In order to adapt the U2net to grasp pose detection and improve its detection performance, we improve detection speed and control sampling depth by simplifying its network structure, while retaining some shallow features in feature fusion to enhance its feature extraction capability. We introduce depthwise separable convolution in the grasp prediction layer, further fusing the features extracted from the backbone to obtain predictive feature maps with stronger feature expressiveness. FocalLoss is selected as the loss function to solve the problem of unbalanced positive and negative samples in network training. We use the Cornell dataset for training and testing, perform pixel-level labeling on the image, and replace the labels that are not conducive to the actual grasping. This adaptation helps the dataset better suit the network training and testing while meeting the real-world grasping requirements of the manipulator. The evaluation results on image-wise and object-wise are 95.65% and 91.20% respectively, and the detection speed is 0.007 s/frame. We also used the method for actual manipulator grasping experiments. The results show that our method has improved accuracy and speed compared with previous methods, and has strong generalization ability and portability.

Pixel-Level Grasp Detection for Unknown Objects with Encoder-Decoder-Inception Deep Network

A Real-Time Grasping Detection Network Architecture for Various Grasping Scenarios

A pixel-level grasp detection method based on Efficient Grasp Aware Network

A Cascaded Deep Learning Framework for Real-time and Robust Grasp Planning

Antipodal-Points-aware Dual-decoding Network for Robotic Visual Grasp Detection Oriented to Multi-object Clutter Scenes

Efficient Fully Convolutional Network and Optimization Approach for Robotic Grasping Detection Based on RGB-D Images

Deep Learning Method for Grasping Novel Objects Using Dexterous Hands

A grasping posture estimation method based on 3D detection network

Robotic Grasp Detection Network Based on Improved Deformable Convolution and Spatial Feature Center Mechanism

Lightweight Convolutional Neural Network with Gaussian-based Grasping Representation for Robotic Grasping Detection

Deep learning for detecting robotic grasps

PEGG-Net: Pixel-Wise Efficient Grasp Generation in Complex Scenes

Graspness Discovery in Clutters for Fast and Accurate Grasp Detection

Deep Vision Networks for Real-Time Robotic Grasp Detection

Lightweight robotic grasping detection network based on dual attention and inverted residual

Robotic Grasp Detection Using Structure Prior Attention and Multiscale Features

Deep instance segmentation and 6D object pose estimation in cluttered scenes for robotic autonomous grasping

A robot grasping detection network based on flexible selection of multi-modal feature fusion structure

Modular Anti-noise Deep Learning Network for Robotic Grasp Detection Based on RGB Images

GraspVDN: scene-oriented grasp estimation by learning vector representations of grasps

Detection Method of Manipulator Grasp Pose Based on RGB-D Image