A Grasp Pose Detection Network Based on the DeepLabv3+Semantic Segmentation Model

Qinjian Zhang,Xiangyan Zhang,Haiyuan Li
DOI: https://doi.org/10.1007/978-3-031-13841-6_67
2022-01-01
Abstract:Grasping is an important and fundamental action for the interaction between robots and the environment. However, because grasping is a complex system engineering, there is still much room for development. At present, many studies use regression to solve the problem of grasp detection or use the unstable 3D point cloud as input, which may cause poor results to a certain extent. In this paper, we propose to use semantic segmentation of pixel-level classification to solve the problem of grasp pose detection. We adopt a grasp detection method based on the DeepLabv3+ model, which includes semantic segmentation and post-processing. In the semantic segmentation part, the classification mask of the objects is predicted through the input RGB image, and then the predicted objects of different classifications are fitted with the minimum bounding directed rectangle to obtain the two-dimensional grasp pose, and the final three-dimensional grasping pose is calculated through the conversion of the input depth image. On the validation dataset, we use the indicator of semantic segmentation to evaluate the proposed network and achieve a great result. In addition, the simulation robot experiment further verifies the effectiveness of the network.
What problem does this paper attempt to address?