SIESEF-FusionNet: Spatial Inter-correlation Enhancement and Spatially-Embedded Feature Fusion Network for LiDAR Point Cloud Semantic Segmentation

Jiale Chen,Fei Xia,Jianliang Mao,Haoping Wang,Chuanlin Zhang
2024-11-11
Abstract:The ambiguity at the boundaries of different semantic classes in point cloud semantic segmentation often leads to incorrect decisions in intelligent perception systems, such as autonomous driving. Hence, accurate delineation of the boundaries is crucial for improving safety in autonomous driving. A novel spatial inter-correlation enhancement and spatially-embedded feature fusion network (SIESEF-FusionNet) is proposed in this paper, enhancing spatial inter-correlation by combining inverse distance weighting and angular compensation to extract more beneficial spatial information without causing redundancy. Meanwhile, a new spatial adaptive pooling module is also designed, embedding enhanced spatial information into semantic features for strengthening the context-awareness of semantic features. Experimental results demonstrate that 83.7% mIoU and 97.8% OA are achieved by SIESEF-FusionNet on the Toronto3D dataset, with performance superior to other baseline methods. A value of 61.1% mIoU is reached on the semanticKITTI dataset, where a marked improvement in segmentation performance is observed. In addition, the effectiveness and plug-and-play capability of the proposed modules are further verified through ablation studies.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the issue of boundary ambiguity between different semantic categories in LiDAR point cloud semantic segmentation. In intelligent perception systems such as autonomous driving, this boundary ambiguity can lead to incorrect decisions, thereby affecting the safety of the system. Therefore, accurately delineating these boundaries is crucial for improving the safety of autonomous driving. To tackle this challenge, the paper proposes a new Spatial Inter-Correlation Enhancement and Spatial Embedding Feature Fusion Network (SIESEF-FusionNet). This network enhances spatial inter-correlation by combining inverse distance weighting and angle compensation to extract more beneficial spatial information without causing redundancy. Additionally, a new spatial adaptive pooling module is designed to embed the enhanced spatial information into semantic features to strengthen the contextual awareness of semantic features. ### Main Contributions 1. **ELSE Module**: An Enhanced Local Spatial Encoding (ELSE) module is proposed, which enhances the inter-correlation of local spatial information by utilizing inverse distance weighting and angle compensation, thereby mitigating the boundary ambiguity problem in point cloud semantic segmentation. To the best of the authors' knowledge, this is the first work to consider spatial inter-correlation in LiDAR semantic segmentation. 2. **SEAP Module**: A Spatial Embedding Adaptive Pooling (SEAP) module is designed to embed the enhanced local spatial encoding into local semantic features, retaining more features that fully combine spatial information and semantic information, thereby enhancing the contextual awareness of local features. 3. **Plug-in Compatibility**: Compared to existing methods, the ELSE and SEAP modules have plug-in compatibility, allowing them to be easily integrated into point cloud segmentation networks to enhance the feature learning capability of the model. ### Experimental Results Experimental results show that SIESEF-FusionNet achieved 83.7% mIoU and 97.8% OA on the Toronto3D dataset, outperforming other baseline methods. On the SemanticKITTI dataset, it achieved 61.1% mIoU, significantly improving segmentation performance. Additionally, ablation studies further validated the effectiveness and plug-in capability of the proposed modules. ### Conclusion By proposing SIESEF-FusionNet, the paper effectively addresses the boundary ambiguity problem in point cloud semantic segmentation, improving the safety and accuracy of applications such as autonomous driving. The performance of the proposed method on multiple outdoor LiDAR datasets demonstrates its potential application value.