Strip and Asymmetric Aggregation Network for Unstructured Terrain Segmentation in Wild Environments

Wei Li,Shishun Tian,Yuhang Zhang,Muxin Liao,Guoguang Hua,Wenbin Zou
DOI: https://doi.org/10.1016/j.engappai.2024.109016
IF: 8
2024-01-01
Engineering Applications of Artificial Intelligence
Abstract:Recent studies have demonstrated the significant importance of two factors: global context and multi-level semantics, for semantic segmentation models. However, obtaining features that extract these two factors typically results in high computational cost, which poses challenges for unstructured terrain segmentation. In this paper, we propose a Strip and Asymmetric Aggregation Network (SAANet) to collect global context and multi-level semantics while ensuring considerable segmentation accuracy to distinguish secure and navigable areas in wild environments. The SAANet network mainly consists of two essential modules: Global Strip Module (GSM) and Asymmetric Fusion Module (AFM). GSM utilizes four stripe convolutions to capture long-range contextual information while maintaining lower computational complexity. AFM consists of two units: Asymmetric Fusion Unit (AFU) and Residual Aggregation Unit (RAU). AFU based on asymmetric structure leverages attention mechanisms to fuse discriminative semantic clues from different scales, aiming to enhance the recognition of objects with high inter-class similarity and obtain an effective multi-level semantic representation. RAU is used to enhance the significant semantic features from AFU to achieve impressive terrain segmentation results. Extensive experimental results on the Robot Unstructured Ground Driving (RUGD) and Rellis Campus of Texas A&M University (RELLIS) datasets demonstrate that SAANet outperforms other state-of-the-art methods in recognizing outdoor unstructured safe drivable terrain.
What problem does this paper attempt to address?