FastGNet: An Efficient 6-DOF Grasp Detection Method with Multi-Attention Mechanisms and Point Transformer Network

Zichao Ding,Aimin Wang,Maosen Gao,Jiazhe Li
DOI: https://doi.org/10.1088/1361-6501/ad1cc5
IF: 2.398
2024-01-11
Measurement Science and Technology
Abstract:A pivotal technology for autonomous robot grasping is efficient and accurate grasp pose detection, which enables robotic arms to grasp objects in cluttered environments without human intervention. However, most existing methods rely on PointNet or CNN as backbones for grasp pose prediction, which may lead to unnecessary computational overhead on invalid grasp points or background information. Consequently, performing efficient grasp pose detection for graspable points in complex scenes becomes a challenge. In this paper, we propose FastGNet, an end-to-end model that combines multiple attention mechanisms with the Transformer architecture to generate 6-DOF grasp poses efficiently. Our approach involves a novel sparse point cloud voxelization technique, preserving the complete mapping between points and voxels while generating positional embeddings for the Transformer network. By integrating unsupervised and supervised attention mechanisms into the grasp model, our method significantly improves the performance of focusing on graspable target points in complex scenes. The effectiveness of FastGNet is validated on the large-scale GraspNet-1Billion dataset. Our approach outperforms previous methods and achieves relatively fast inference times, highlighting its potential to advance autonomous robot grasping capabilities.
engineering, multidisciplinary,instruments & instrumentation
What problem does this paper attempt to address?