RSProtoSeg: High Spatial Resolution Remote Sensing Images Segmentation Based on Non-Learnable Prototypes

Wenjie Sun,Jie Zhang,Yujie Lei,Danfeng Hong
DOI: https://doi.org/10.1109/tgrs.2024.3404922
IF: 8.2
2024-06-12
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Semantic segmentation of high spatial resolution (HSR) remote sensing images presents unique challenges due to the imbalanced foreground–background distribution and large intraclass variance. This study proposes a novel semantic segmentation algorithm based on non-learnable prototypes, named RSProtoSeg. This approach optimizes the spatial relationship between foreground–background prototypes and intraclass prototypes. Specifically, we propose a foreground–background distance optimization loss function to enhance sparsity between these phototypes, effectively mitigating foreground–background distribution imbalances. Moreover, we introduce an online discrete clustering module that represents each class with a set of prototypes and adds an adaptive regular term penalty to promote sparse structure and reduce the variance issue. Evaluation on three remote sensing datasets (iSAID, ISPRS Potsdam, and Vaihingen) demonstrates significant accuracy improvements, aligning our approach with state-of-the-art methods. Our non-learnable prototype-based approach offers a promising solution for semantic segmentation in HSR remote sensing images.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?