Abstract:<p>Due to its wide applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Benefiting from its hierarchical abstract ability, the deep semantic segmentation network (DSSN) has achieved tremendous success on RS image semantic segmentation and has gradually become the mainstream technology. However, the superior performance of DSSN highly depends on two conditions: (I) massive quantities of labeled training data exist; (II) the testing data seriously resemble the training data. In actual RS applications, it is difficult to fully meet these conditions due to the RS sensor variation and the distinct landscape variation in different geographic locations. To make DSSN fit the actual RS scenario, this paper exploits the cross-domain RS image semantic segmentation task, which means that DSSN is trained on one labeled dataset (i.e., the source domain) but is tested on another varied dataset (i.e., the target domain). In this setting, the performance of DSSN is inevitably very limited due to the data shift between the source and target domains. To reduce the disadvantageous influence of data shift, this paper proposes a novel objective function with multiple weakly-supervised constraints to learn DSSN for cross-domain RS image semantic segmentation. Through carefully examining the characteristics of cross-domain RS image semantic segmentation, multiple weakly-supervised constraints include the weakly-supervised transfer invariant constraint (WTIC), weakly-supervised pseudo-label constraint (WPLC) and weakly-supervised rotation consistency constraint (WRCC). Specifically, DualGAN is recommended to conduct unsupervised style transfer between the source and target domains to carry out WTIC. To make full use of the merits of multiple constraints, this paper presents a dynamic optimization strategy that dynamically adjusts the constraint weights of the objective function during the training process. With full consideration of the characteristics of the cross-domain RS image semantic segmentation task, this paper gives two cross-domain RS image semantic segmentation settings: (I) variation in geographic location and (II) variation in both geographic location and imaging mode. Extensive experiments demonstrate that our proposed method remarkably outperforms the state-of-the-art methods under both of these settings. The collected datasets and evaluation benchmarks have been made publicly available online (<a href="https://github.com/te-shi/MUCSS">https://github.com/te-shi/MUCSS</a>).</p>

Weakly Supervised Training of Universal Visual Concepts for Multi-domain Semantic Segmentation

In Defense Of Multi-Source Omni-Supervised Efficient Convnet For Robust Semantic Segmentation In Heterogeneous Unseen Domains

Omnisupervised Omnidirectional Semantic Segmentation

Semi-Supervised Learning for Visual Bird's Eye View Semantic Segmentation

MSeg: A Composite Dataset for Multi-domain Semantic Segmentation

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Learning 3D Semantic Segmentation with only 2D Image Supervision

Learning deep semantic segmentation network under multiple weakly-supervised constraints for cross-domain remote sensing image semantic segmentation

Multi-dataset Pretraining: A Unified Model for Semantic Segmentation

Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation

Self-supervised Semantic Segmentation Grounded in Visual Concepts

Training Semantic Segmentation on Heterogeneous Datasets

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.

Unsupervised cross domain semantic segmentation with mutual refinement and information distillation

Multi-Target Unsupervised Domain Adaptation for Semantic Segmentation without External Data

An Empirical Study on Multi-domain Robust Semantic Segmentation

Scaling up Multi-domain Semantic Segmentation with Sentence Embeddings

Learning Open-vocabulary Semantic Segmentation Models from Natural Language Supervision.

Contrastive Learning and Self-Training for Unsupervised Domain Adaptation in Semantic Segmentation

On Boosting Semantic Street Scene Segmentation with Weak Supervision

Multichannel Semantic Segmentation with Unsupervised Domain Adaptation