Efficient-Learning Grasping and Pushing in Dense Stacking Via Mask Function and Pixel Overlap Rate

Jie Lian,Juncheng Jiang,Chaochao Qiu,Qinghui Pan,Yongxiang Dong,Zhao Wang,Dong Wang
DOI: https://doi.org/10.1109/isas61044.2024.10552456
2024-01-01
Abstract:In the field of robotic grasping, grasping tightly stacked objects is a formidable challenge. The presence of non-target objects significantly increases the risk of grasping failure. To solve this problem, the aim is to take action while grasping, reduce clutter, and effectively complete grasping tasks. Based on deep reinforcement learning, the Efficient Learning Grasping(ELG) framework is proposed that integrates pushing actions alongside grasping to efficiently pick up objects from cluttered environments with reduced training time. However, in the process of grasping, the agent is rewarded only when the grasp is successful, so the data collection is extremely inefficient and spends a lot of time on the training of grasping strategy. In our work, two strategies are proposed to improve the training efficiency. First, two mask functions are introduced as prior knowledge to enable the agent to focus on more meaningful regions while ignoring some unnecessary regions, aiming at effective learning. Second, The Pixel Overlap Rate (POR) is introduced to quantify the environment clutter level, and set a new push action reward function based on the POR. The implementation of the above ideas in a simulation environment and transferred the learned models to the real world and verified them in practice. Compared with existing method, our method improves the training efficiency of the initial algorithm while having a higher success rate of grasping. Detailed grasping video at https://youtu.be/CqtSvaVl60M.
What problem does this paper attempt to address?