Abstract:Achieving precise grasping of specific targets in complex and crowded scenes is of great importance. Existing works often rely on the combination of pushing and grasping strategies to separate the target from surrounding obstacles. However, in crowded environments with limited space, traditional combined push-grasp methods struggle to effectively separate the target from dense obstacles due to insufficient action execution space. To address the problem of target grasping in space-constrained scenes, this paper proposes a target grasping algorithm based on policy search mechanism. By intelligently identifying and grasping the obstacles around the target, the complexity of the workspace is reduced, thereby improving the action efficiency and the success rate of target grasping. Firstly, a Q-value prediction network is trained based on deep reinforcement learning to extract features from input images and predict the Q-value of each pixel point to determine the best grasping point. Secondly, to improve the network's prediction ability, the SE-DenseNet network structure is proposed for feature extraction and enhancing the network's sensitivity to key feature information. Finally, to verify the effectiveness of the proposed grasping framework, a space-constrained working scene is built in the CoppeliaSim simulation platform for experimental testing. The results show that the proposed framework achieves an action efficiency of 82.6% and a grasping success rate of 83.5%, significantly outperforming traditional methods. Moreover, the impact of each part of the algorithm on system performance is further verified through ablation studies, fully demonstrating the efficiency and reliability of the proposed framework.

Gated Self Attention Network for Efficient Grasping of Target Objects in Stacked Scenarios

A Multi-task Convolutional Neural Network for Autonomous Robotic Grasping in Object Stacking Scenes

A Semantic Robotic Grasping Framework Based on Multi-Task Learning in Stacking Scenes.

Robotic Grasping in Multi-Object Stacking Scenes Based on Visual Reasoning

Robotic Grasping Method Based on 3D Vision for Stacked Rectangular Objects

Secure Grasping Detection of Objects in Stacked Scenes Based on Single-Frame RGB Images

Task-Oriented Grasping In Object Stacking Scenes With Crf-Based Semantic Model

Efficient Grasp Detection Network with Gaussian-Based Grasp Representation for Robotic Manipulation

A two-stage grasp detection method for sequential robotic grasping in stacking scenarios

Grasp Manipulation Relationship Detection Based on Graph Sample and Aggregation

Visual Manipulation Relationship Detection based on Gated Graph Neural Network for Robotic Grasping

RPRG: Toward Real-time Robotic Perception, Reasoning and Grasping with One Multi-task Convolutional Neural Network.

Robot Grasping Based on Stacked Object Classification Network and Grasping Order Planning

A graph reasoning method for multi-object unordered stacking scenarios

Efficient-Learning Grasping and Pushing in Dense Stacking Via Mask Function and Pixel Overlap Rate

NG-Net: No-Grasp annotation grasp detection network for stacked scenes

Grasping Pose Detection for Loose Stacked Object Based on Convolutional Neural Network with Multiple Self-Powered Sensors Information

Robotic Arm Target Grasping Algorithm Based on Obstacle Removal in Space-Constrained Scenes

UPG: 3D Vision-Based Prediction Framework for Robotic Grasping in Multi-Object Scenes.

EGNet: Efficient Robotic Grasp Detection Network

Grasping with Occlusion-Aware Ally Method in Complex Scenes