Abstract:Leveraging multi-view remote sensing images in scene classification tasks significantly enhances the precision of such classifications. This approach, however, poses challenges due to the simultaneous use of multi-view images, which often leads to a misalignment between the visual content and semantic labels, thus complicating the classification process. In addition, as the number of image viewpoints increases, the quality problem for remote sensing images further limits the effectiveness of multi-view image classification. Traditional scene classification methods predominantly employ SoftMax deep learning techniques, which lack the capability to assess the quality of remote sensing images or to provide explicit explanations for the network's predictive outcomes. To address these issues, this paper introduces a novel end-to-end multi-view decision fusion network specifically designed for remote sensing scene classification. The network integrates information from multi-view remote sensing images under the guidance of image credibility and uncertainty, and when the multi-view image fusion process encounters conflicts, it greatly alleviates the conflicts and provides more reasonable and credible predictions for the multi-view scene classification results. Initially, multi-scale features are extracted from the multi-view images using convolutional neural networks (CNNs). Following this, an asymptotic adaptive feature fusion module (AAFFM) is constructed to gradually integrate these multi-scale features. An adaptive spatial fusion method is then applied to assign different spatial weights to the multi-scale feature maps, thereby significantly enhancing the model's feature discrimination capability. Finally, an evidence decision fusion module (EDFM), utilizing evidence theory and the Dirichlet distribution, is developed. This module quantitatively assesses the uncertainty in the multi-perspective image classification process. Through the fusing of multi-perspective remote sensing image information in this module, a rational explanation for the prediction results is provided. The efficacy of the proposed method was validated through experiments conducted on the AiRound and CV-BrCT datasets. The results show that our method not only improves single-view scene classification results but also advances multi-view remote sensing scene classification results by accurately characterizing the scene and mitigating the conflicting nature of the fusion process.

Minimum Volume Simplex-Based Scene Representation and Attribute Recognition with Feature Fusion.

Scene Recognition with Objectness, Attribute and Category Learning

Remote Sensing Scene Classification Using Sparse Representation-Based Framework with Deep Feature Fusion

Enhanced Multi-Scale Feature Adaptive Fusion Sparse Convolutional Network for Large-Scale Scenes Semantic Segmentation

Multi-Level Ensemble Network for Scene Recognition.

Fusing Deep Local And Global Features For Remote Sensing Image Scene Classification

An Attribute-Based High-Level Image Representation for Scene Classification

Few-Shot Remote Sensing Scene Classification with Multi-Metric Fusion

Deep feature fusion through adaptive discriminative metric learning for scene recognition

Multi-Channel and Multi-Scale Mid-Level Image Representation for Scene Classification

Remote Sensing Image Scene Classification Based on Deep Multi-branch Feature Fusion Network

Adjacent-Atrous Mechanism for Expanding Global Receptive Fields: An End-to-End Network for Multiattribute Scene Analysis in Remote Sensing Imagery

Fusion by Synthesizing: A Multi-View Deep Neural Network for Zero-Shot Recognition

Scene Recognition Based on Feature Learning from Multi-Scale Salient Regions

Scene image categorization algorithm based on multi-level features representation

MVFF: Multi-view Feature Fusion for Few-shot Remote Sensing Image Scene Classification

Multilayer Feature Fusion Network for Scene Classification in Remote Sensing

G-MS2F: GoogLeNet Based Multi-Stage Feature Fusion of Deep CNN for Scene Recognition.

A Multi-Scale Approach For Remote Sensing Scene Classification Based On Feature Maps Selection And Region Representation

Multi-View Scene Classification Based on Feature Integration and Evidence Decision Fusion

Locally Supervised Deep Hybrid Model for Scene Recognition