Mobile-SegFormer: A Lightweight Semantic Segmentation Network

Zhenyuan Lin,Weikun Li,Dahua Gao
DOI: https://doi.org/10.1007/978-981-97-5615-5_24
2024-01-01
Abstract:In recent years, the development of mobile vision applications has stimulated the demand for lightweight semantic segmentation technology. However, currently lightweight semantic segmentation networks suffer from low accuracy issues. This article proposes a lightweight semantic segmentation algorithm called Mobile-SegFormer with high accuracy. Mobile-SegFormer uses an improved MobileNetv3 as the backbone network for feature extraction, effectively reducing the number of parameters. In order to further improve accuracy, this paper constructs a Spatial Pyramid Merge (SPM) module and a Spatial Pyramid Cascade (SPC) module. The SPM module, as the fundamental module for improving multi-scale learning in MobileNetv3 networks, effectively improves the segmentation accuracy of the model. In order to further improve the segmentation performance of the model, an SPC module is introduced in the decoding stage, allowing the model to more effectively utilize high-level and low-level semantic feature information. The network was ultimately tested on the Cityscapes, CamVid, and ADE20K datasets, and the results showed that Mobile-SegFormer achieved higher MIoU with fewer Params and lower FLOPs values on the three experimental datasets.
What problem does this paper attempt to address?