Abstract:Accurately detecting the appropriate grasp configurations is the central task for the robot to grasp an object. Existing grasp detection methods usually overlook the depth image or only regard it as a two-dimensional distance image, which makes it difficult to capture the three-dimensional structural characteristics of target object. In this article, we transform the depth image to point cloud and propose a two-stage grasp detection method based on candidate grasp detection from RGB image and spatial feature rescoring from point cloud. Specifically, we first adopt the recently proposed high-performance rotation object detection method for aerial images, named R3Det, to grasp detection task, obtaining the candidate grasp boxes and their appearance scores. Then, point clouds within each candidate grasp box are normalized and evaluated to get the point cloud quality scores, which are fused with the established point cloud quantity scoring model to obtain spatial scores. Finally, appearance scores and their corresponding spatial scores are combined to output high-quality grasp detection results. The proposed method effectively fuses three types of grasp scoring modules, thus is called Score Fusion Grasp Net. Besides, we propose and adopt top-k grasp metric to effectively reflect the success rate of algorithm in actual grasp execution. Score Fusion Grasp Net obtains 98.5% image-wise accuracy and 98.1% object-wise accuracy on Cornell Grasp Dataset, which exceeds the performances of state-of-the-art methods. We also use the robotic arm to conduct physical grasp experiments on 15 kinds of household objects and 11 kinds of adversarial objects. The results show that the proposed method still has a high success rate when facing new objects.

Lightweight Pixel-Wise Generative Robot Grasping Detection Based on RGB-D Dense Fusion

LiteGrasp: A Light Robotic Grasp Detection Via Semi-Supervised Knowledge Distillation

Efficient Grasp Detection Network with Gaussian-Based Grasp Representation for Robotic Manipulation

A New Robotic Grasp Detection Method Based on RGB-D Deep Fusion.

Real-Time Pixel-Wise Grasp Detection Based on RGB-D Feature Dense Fusion

Efficient Fully Convolutional Network and Optimization Approach for Robotic Grasping Detection Based on RGB-D Images

Robotic Grasp Detection Method Based on Lightweight Feature Fusion Convolutional Neural Network

A robot grasping detection network based on flexible selection of multi-modal feature fusion structure

High-performance Pixel-level Grasp Detection Based on Adaptive Grasping and Grasp-aware Network

Bilateral Cross-Modal Fusion Network for Robot Grasp Detection

Rotation adaptive grasping estimation network oriented to unknown objects based on novel RGB-D fusion strategy

RGB Matters: Learning 7-DoF Grasp Poses on Monocular RGBD Images

DSC-GraspNet: A Lightweight Convolutional Neural Network for Robotic Grasp Detection

Grasp Detection Via Visual Rotation Object Detection and Point Cloud Spatial Feature Scoring

GraspFusionNet: a Two-Stage Multi-Parameter Grasp Detection Network Based on RGB–XYZ Fusion in Dense Clutter

Detection Method of Manipulator Grasp Pose Based on RGB-D Image

A pixel-level grasp detection method based on Efficient Grasp Aware Network

A Robot Grasp Relationship Detection Network Based on the Fusion of Multiple Features

Robust Robot Grasp Detection in Multimodal Fusion

A Vision-based Robot Grasping System

A Real-Time Grasping Detection Network Architecture for Various Grasping Scenarios