Abstract:Because of its wide potential applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Until now, deep semantic segmentation network (DSSN) has achieved a certain degree of success on semantic segmentation of RS imagery and can obviously outperform the traditional methods based on hand-crafted features. As a classic data-driven technique, DSSN can be trained by an end-to-end mechanism and is competent for employing low-level and mid-level cues (i.e., the discriminative image structure) to understand RS images. However, its interpretability and reliability are poor due to the nature weakness of the data-driven deep learning methods. By contrast, human beings have an excellent inference capacity and can reliably interpret RS imagery with the basic RS domain knowledge. Ontological reasoning is an ideal way to imitate and employ the domain knowledge of human beings. However, it is still rarely explored and adopted in the RS domain. As a solution of the aforementioned critical limitation of DSSN, this study proposes a collaboratively boosting framework (CBF) to combine the data-driven deep learning module and knowledge-guided ontology reasoning module in an iterative manner. The deep learning module adopts the DSSN architecture and takes the integration of the original image and inferred channels as the input of the DSSN. In addition, the ontology reasoning module is composed of intra- and extra-taxonomy reasoning. More specifically, the intra-taxonomy reasoning directly corrects misclassifications of the deep learning module based on the domain knowledge, which is the key to improve the classification performance. The extra-taxonomy reasoning aims to generate the inferred channels beyond the current taxonomy to improve the discriminative performance of DSSN in the original RS image space. On the one hand, benefiting from the referred channels from the ontology reasoning module, the deep learning module using the integration of the original image and referred channels can achieve better classification performance than only using the original image. On the other hand, better classification results from the deep learning module further improve the performance of the ontology reasoning module. As a whole, the deep learning and ontology reasoning modules are mutually boosted in the iterations. Extensive experiments on two publicly open RS datasets such as UCM and ISPRS Potsdam show that our proposed CBF can outperform the competitive baselines with a large margin.

Integrating knowledge-based sparse representation for image detection

Joint CRF and Locality-Consistent Dictionary Learning for Semantic Segmentation.

Optimizing Spatial Relationships in GCN to Improve the Classification Accuracy of Remote Sensing Images

ISDNet: Integrating Shallow and Deep Networks for Efficient Ultra-high Resolution Segmentation

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection

Combining deep learning and ontology reasoning for remote sensing image semantic segmentation

SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-Based 3-D Object Detection

Learning to Reduce Information Bottleneck for Object Detection in Aerial Images

Hybrid Knowledge Routed Modules for Large-scale Object Detection

Implicit Kernel Sparse Shape Representation: A Sparse-Neighbors-Based Objection Segmentation Framework

Semantic-Context Graph Network for Point-based 3D Object Detection

Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection

Learning Coarse-To-Fine Sparselets For Efficient Object Detection And Scene Classification

A Stage-Adaptive Selective Network with Position Awareness for Semantic Segmentation of LULC Remote Sensing Images

Semantic Segmentation of Remote Sensing Images by Interactive Representation Refinement and Geometric Prior-Guided Inference

A deep learning based framework for remote sensing image ground object segmentation

A Hybrid Framework for Referring Image Segmentation: Dual-Decoder Model with SAM Complementation

An efficient concept detection system via sparse ensemble learning.

An Effective and Lightweight Hybrid Network for Object Detection in Remote Sensing Images

Semantic segmentation for remote sensing images via dense feature extraction and companion loss neural network

Global Context Dependencies Aware Network for Efficient Semantic Segmentation of Fine-Resolution Remoted Sensing Images