Abstract:Because of its wide potential applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Until now, deep semantic segmentation network (DSSN) has achieved a certain degree of success on semantic segmentation of RS imagery and can obviously outperform the traditional methods based on hand-crafted features. As a classic data-driven technique, DSSN can be trained by an end-to-end mechanism and is competent for employing low-level and mid-level cues (i.e., the discriminative image structure) to understand RS images. However, its interpretability and reliability are poor due to the nature weakness of the data-driven deep learning methods. By contrast, human beings have an excellent inference capacity and can reliably interpret RS imagery with the basic RS domain knowledge. Ontological reasoning is an ideal way to imitate and employ the domain knowledge of human beings. However, it is still rarely explored and adopted in the RS domain. As a solution of the aforementioned critical limitation of DSSN, this study proposes a collaboratively boosting framework (CBF) to combine the data-driven deep learning module and knowledge-guided ontology reasoning module in an iterative manner. The deep learning module adopts the DSSN architecture and takes the integration of the original image and inferred channels as the input of the DSSN. In addition, the ontology reasoning module is composed of intra- and extra-taxonomy reasoning. More specifically, the intra-taxonomy reasoning directly corrects misclassifications of the deep learning module based on the domain knowledge, which is the key to improve the classification performance. The extra-taxonomy reasoning aims to generate the inferred channels beyond the current taxonomy to improve the discriminative performance of DSSN in the original RS image space. On the one hand, benefiting from the referred channels from the ontology reasoning module, the deep learning module using the integration of the original image and referred channels can achieve better classification performance than only using the original image. On the other hand, better classification results from the deep learning module further improve the performance of the ontology reasoning module. As a whole, the deep learning and ontology reasoning modules are mutually boosted in the iterations. Extensive experiments on two publicly open RS datasets such as UCM and ISPRS Potsdam show that our proposed CBF can outperform the competitive baselines with a large margin.

Deep Semantic Understanding of High Resolution Remote Sensing Image

A lightweight and stochastic depth residual attention network for remote sensing scene classification

Semantic Relation Model and Dataset for Remote Sensing Scene Understanding

Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation

Hierarchical Self-Attention Embedded Neural Network With Dense Connection for Remote-Sensing Image Semantic Segmentation

Research on semantic segmentation algorithm of multispectral remote sensing image based on deep learning

Object-Enhanced Semantic Segmentation Model for High-Resolution Remote Sensing Images

A Multi-Modal High Spatial Resolution Aerial Imagery Scene Classification Model with Visual Enhancement

Exploring Models and Data for Remote Sensing Image Caption Generation

Semantic Segmentation With Attention Mechanism for Remote Sensing Images

Recurrent Attention and Semantic Gate for Remote Sensing Image Captioning

Improving Interpretability of Deep Neural Networks with Semantic Information

SSNet: A Novel Transformer and CNN Hybrid Network for Remote Sensing Semantic Segmentation

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge.

Combining deep learning and ontology reasoning for remote sensing image semantic segmentation

RS3Mamba: Visual State Space Model for Remote Sensing Image Semantic Segmentation

Remote sensing image captioning via Variational Autoencoder and Reinforcement Learning

Multi-modal Remote Sensing Image Description Based on Word Embedding and Self-Attention Mechanism

Multi-path cyclic residual network for semantic segmentation in remote sensing images

RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation