Abstract:<p>Due to its wide applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Benefiting from its hierarchical abstract ability, the deep semantic segmentation network (DSSN) has achieved tremendous success on RS image semantic segmentation and has gradually become the mainstream technology. However, the superior performance of DSSN highly depends on two conditions: (I) massive quantities of labeled training data exist; (II) the testing data seriously resemble the training data. In actual RS applications, it is difficult to fully meet these conditions due to the RS sensor variation and the distinct landscape variation in different geographic locations. To make DSSN fit the actual RS scenario, this paper exploits the cross-domain RS image semantic segmentation task, which means that DSSN is trained on one labeled dataset (i.e., the source domain) but is tested on another varied dataset (i.e., the target domain). In this setting, the performance of DSSN is inevitably very limited due to the data shift between the source and target domains. To reduce the disadvantageous influence of data shift, this paper proposes a novel objective function with multiple weakly-supervised constraints to learn DSSN for cross-domain RS image semantic segmentation. Through carefully examining the characteristics of cross-domain RS image semantic segmentation, multiple weakly-supervised constraints include the weakly-supervised transfer invariant constraint (WTIC), weakly-supervised pseudo-label constraint (WPLC) and weakly-supervised rotation consistency constraint (WRCC). Specifically, DualGAN is recommended to conduct unsupervised style transfer between the source and target domains to carry out WTIC. To make full use of the merits of multiple constraints, this paper presents a dynamic optimization strategy that dynamically adjusts the constraint weights of the objective function during the training process. With full consideration of the characteristics of the cross-domain RS image semantic segmentation task, this paper gives two cross-domain RS image semantic segmentation settings: (I) variation in geographic location and (II) variation in both geographic location and imaging mode. Extensive experiments demonstrate that our proposed method remarkably outperforms the state-of-the-art methods under both of these settings. The collected datasets and evaluation benchmarks have been made publicly available online (<a href="https://github.com/te-shi/MUCSS">https://github.com/te-shi/MUCSS</a>).</p>

Weakly Supervised Semantic Segmentation in Aerial Imagery via Explicit Pixel-Level Constraints

Weakly Supervised Semantic Segmentation in Aerial Imagery via Cross-Image Semantic Mining

One model is enough: Toward multiclass weakly supervised remote sensing image semantic segmentation

A Creative Weak Supervised Semantic Segmentation for Remote Sensing Images

Weakly Supervised Semantic Segmentation With Consistency-Constrained Multiclass Attention for Remote Sensing Scenes

Weakly Supervised Semantic Segmentation of Remote Sensing Images Based on Progressive Mining and Saliency-Enhanced Self-Attention

Semantic Attention and Structured Model for Weakly Supervised Instance Segmentation in Optical and SAR Remote Sensing Imagery

Superpixel Consistency Saliency Map Generation for Weakly Supervised Semantic Segmentation of Remote Sensing Images

Semantic Attention and Scale Complementary Network for Instance Segmentation in Remote Sensing Images

A Spectral–Spatial Context-Boosted Network for Semantic Segmentation of Remote Sensing Images

Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation

Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation

Simple and Efficient: A Semisupervised Learning Framework for Remote Sensing Image Semantic Segmentation

On the Effectiveness of Weakly Supervised Semantic Segmentation for Building Extraction From High-Resolution Remote Sensing Imagery

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.

Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation

Weakly Supervised Semantic Segmentation with a Multiscale Model

Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling

Towards Single Stage Weakly Supervised Semantic Segmentation