Abstract:With sufficient practice, humans can grab objects they have never seen before through brain decision-making. However, the manipulators, which has a wide range of applications in industrial production, can still only grab specific objects. Because most of the grasp algorithms rely on prior knowledge such as hand-eye calibration results, object model features, and can only target specific types of objects. When the task scenario and the operation target change, it cannot perform effective redeployment. In order to solve the above problems, academia often uses reinforcement learning to train grasping algorithms. However, the method of reinforcement learning in the field of manipulators grasping mainly encounters these main problems: insufficient sample utilization, poor algorithm stability, and limited exploration. This article uses LfD, BC, and DDPG to improve sample utilization. Use multiple critics to integrate and evaluate input actions to solve the problem of algorithm instability. Finally, inspired by Thompson's sampling idea, the input action is evaluated from different angles, which increases the algorithm's exploration of the environment and reduces the number of interactions with the environment. EDDPG and EBDDPG algorithm is designed in the article. In order to further improve the generalization ability of the algorithm, this article does not use extra information that is difficult to obtain directly on the physical platform, such as the real coordinates of the target object and the continuous motion space at the end of the manipulator in the Cartesian coordinate system is used as the output of the decision. The simulation results show that, under the same number of interactions, the manipulators' success rate in grabbing 1000 random objects has increased more than double and reached state-of-the-art(SOTA) performance.

More Than a Feeling: Learning to Grasp and Regrasp using Vision and Touch

Ensemble Bootstrapped Deep Deterministic Policy Gradient For Vision-Based Robotic Grasping

Learning to Grasp Without Seeing

Grasping Using Tactile Sensing and Deep Calibration

Vision-Based Robotic Object Grasping—A Deep Reinforcement Learning Approach

Tactile Regrasp: Grasp Adjustments via Simulated Tactile Transformations

Automatic Grasping Using Tactile Sensing and Deep Calibration

Learning Task-Based Robotic Grasping with Vision, Haptics and Proprioception

Learning to Regrasp Using Visual–Tactile Representation-Based Reinforcement Learning

Humanoid Robot Grasping with a Soft Gripper Through a Learned Inverse Model of a Central Pattern Generator and Tactile Servoing

Reaching, Grasping and Re-grasping: Learning Multimode Grasping Skills

Learning Fine Pinch-Grasp Skills using Tactile Sensing from A Few Real-world Demonstrations

Leveraging Contact Forces for Learning to Grasp

Integrating High-Resolution Tactile Sensing into Grasp Stability Prediction

Learning Gentle Grasping from Human-Free Force Control Demonstration

A Grasp Pose is All You Need: Learning Multi-fingered Grasping with Deep Reinforcement Learning from Vision and Touch

The Role of Tactile Sensing in Learning and Deploying Grasp Refinement Algorithms

Learning a visuomotor controller for real world robotic grasping using simulated depth images

Learning Robust Grasping Strategy Through Tactile Sensing and Adaption Skill

Multifingered Grasping Based on Multimodal Reinforcement Learning

Planning Visual-Tactile Precision Grasps via Complementary Use of Vision and Touch