Abstract:Because of its wide potential applications, remote sensing (RS) image semantic segmentation has attracted increasing research interest in recent years. Until now, deep semantic segmentation network (DSSN) has achieved a certain degree of success on semantic segmentation of RS imagery and can obviously outperform the traditional methods based on hand-crafted features. As a classic data-driven technique, DSSN can be trained by an end-to-end mechanism and is competent for employing low-level and mid-level cues (i.e., the discriminative image structure) to understand RS images. However, its interpretability and reliability are poor due to the nature weakness of the data-driven deep learning methods. By contrast, human beings have an excellent inference capacity and can reliably interpret RS imagery with the basic RS domain knowledge. Ontological reasoning is an ideal way to imitate and employ the domain knowledge of human beings. However, it is still rarely explored and adopted in the RS domain. As a solution of the aforementioned critical limitation of DSSN, this study proposes a collaboratively boosting framework (CBF) to combine the data-driven deep learning module and knowledge-guided ontology reasoning module in an iterative manner. The deep learning module adopts the DSSN architecture and takes the integration of the original image and inferred channels as the input of the DSSN. In addition, the ontology reasoning module is composed of intra- and extra-taxonomy reasoning. More specifically, the intra-taxonomy reasoning directly corrects misclassifications of the deep learning module based on the domain knowledge, which is the key to improve the classification performance. The extra-taxonomy reasoning aims to generate the inferred channels beyond the current taxonomy to improve the discriminative performance of DSSN in the original RS image space. On the one hand, benefiting from the referred channels from the ontology reasoning module, the deep learning module using the integration of the original image and referred channels can achieve better classification performance than only using the original image. On the other hand, better classification results from the deep learning module further improve the performance of the ontology reasoning module. As a whole, the deep learning and ontology reasoning modules are mutually boosted in the iterations. Extensive experiments on two publicly open RS datasets such as UCM and ISPRS Potsdam show that our proposed CBF can outperform the competitive baselines with a large margin.

Discriminative semantic region selection for fine-grained recognition.

Joint CRF and Locality-Consistent Dictionary Learning for Semantic Segmentation.

Learning Regions and Descriptors for Fine-grained Recognition

Discriminative Region Enhancing and Suppression Network for Fine-Grained Visual Categorization

Learning a Discriminative Feature Network for Semantic Segmentation

Selective Sparse Sampling for Fine-Grained Image Recognition

Combining deep learning and ontology reasoning for remote sensing image semantic segmentation

Discriminative Features Reconstruction Network For Semantic Segmentation

Semantic-Guided Information Alignment Network for Fine-Grained Image Recognition

Region-based Adaptive Association Learning for Robust Image Scene Recognition

A Top-Down Manner-Based DCNN Architecture for Semantic Image Segmentation.

Discriminative Segment Focus Network for Fine-grained Video Action Recognition

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Region based Ensemble Learning Network for Fine-grained Classification

Spatial-aware Collaborative Region Mining for Fine-Grained Recognition

Selecting Discriminative Features for Fine-Grained Visual Classification

Deep Feature Selection-And-Fusion for RGB-D Semantic Segmentation

On the Selective and Invariant Representation of DCNN for High-Resolution Remote Sensing Image Recognition

Aerial image semantic segmentation using DCNN predicted distance maps

Learning Discriminative Visual Dictionary for Natural Scene Categorization

Fine-Grained Visual Classification Via Simultaneously Learning of Multi-regional Multi-grained Features