FAT: Field-Aware Transformer for Point Cloud Segmentation with Adaptive Attention Fields

Junjie Zhou,Baolin Liu,Yongping Xiong,Chinwai Chiu,Fangyu Liu,Xiangyang Gong
DOI: https://doi.org/10.1109/tii.2024.3393572
IF: 12.3
2024-01-01
IEEE Transactions on Industrial Informatics
Abstract:Point cloud segmentation is crucial for various industrial applications, such as autonomous driving and robotics. Recent developments underscore the significant potential of transformer models in this field. However, existing attention mechanisms apply the same feature learning paradigm for all points equally, ignoring the considerable size differences among objects in a scene. To rectify this, we introduce the field-aware transformer (FAT), engineered to tailor effective receptive fields to objects of varying sizes. Our FAT achieves field-aware learning through two primary components: the multigranularity attention (MGA) scheme and the reattention module. The MGA scheme is proficient in aggregating tokens from distant areas while preserving multiscale features within each attention layer. The reattention module dynamically adjusts the attention scores to the fine- and coarse-grained features output by MGA for each point. Extensive experimental results underscore the effectiveness and efficiency of our FAT, which delivers state-of-the-art performance on both the stanford 3D indoor scene dataset (S3DIS) and ScanNetV2 datasets.
What problem does this paper attempt to address?