Vision-based robotic grasping using faster R-CNN–GRCNN dual-layer detection mechanism
Jianguo Duan,Liwen Zhuang,Qinglei Zhang,Jiyun Qin,Ying Zhou
DOI: https://doi.org/10.1177/09544054241249217
2024-05-19
Proceedings of the Institution of Mechanical Engineers Part B Journal of Engineering Manufacture
Abstract:Proceedings of the Institution of Mechanical Engineers, Part B: Journal of Engineering Manufacture, Ahead of Print. Visual grasping technology plays a crucial role in various robotic applications, such as industrial automation, warehousing, and logistics. However, current visual grasping methods face limitations when applied in industrial scenarios. Focusing solely on the workspace where the grasping target is located restricts the camera's ability to provide additional environmental information. On the other hand, monitoring the entire working area introduces irrelevant data and hinders accurate grasping pose estimation. In this paper, we propose a novel approach that combines a global camera and a depth camera to enable efficient target grasping. Specifically, we introduce a dual-layer detection mechanism based on Faster R-CNN–GRCNN. By enhancing the Faster R-CNN with attention mechanisms, we focus the global camera on the workpiece placement area and detect the target object within that region. When the robot receives the command to grasp the workpiece, the improved Faster R-CNN recognizes the workpiece and guides the robot towards the target location. Subsequently, the depth camera on the robot determines the grasping pose using Generative Residual Convolutional Neural Network and performs the grasping action. We validate the feasibility and effectiveness of our proposed framework through experiments involving collaborative assembly tasks using two robotic arms.
engineering, mechanical, manufacturing