Abstract:The advancement in satellite image sensors has enabled the acquisition of high-resolution remote sensing (HRRS) images. However, interpreting these images accurately and obtaining the computational power needed to do so is challenging due to the complexity involved. This manuscript proposed a multi-stream convolutional neural network (CNN) fusion framework that involves multi-scale and multi-CNN integration for HRRS image recognition. The pre-trained CNNs were used to learn and extract semantic features from multi-scale HRRS images. Feature extraction using pre-trained CNNs is more efficient than training a CNN from scratch or fine-tuning a CNN. Discriminative canonical correlation analysis (DCCA) was used to fuse deep features extracted across CNNs and image scales. DCCA reduced the dimension of the features extracted from CNNs while providing a discriminative representation by maximizing the within-class correlation and minimizing the between-class correlation. The proposed model has been evaluated on NWPU-RESISC45 and UC Merced datasets. The accuracy associated with DCCA was 10% and 6% higher than discriminant correlation analysis (DCA) in the NWPU-RESISC45 and UC Merced datasets. The advantage of DCCA was better demonstrated in the NWPU-RESISC45 dataset due to the incorporation of richer within-class variability in this dataset. While both DCA and DCCA minimize between-class correlation, only DCCA maximizes the within-class correlation and, therefore, attains better accuracy. The proposed framework achieved higher accuracy than all state-of-the-art frameworks involving unsupervised learning and pre-trained CNNs and 2–3% higher than the majority of fine-tuned CNNs. The proposed framework offers computational time advantages, requiring only 13 s for training in NWPU-RESISC45, compared to a day for fine-tuning the existing CNNs. Thus, the proposed framework achieves a favourable balance between efficiency and accuracy in HRRS image recognition.

DRC: Discrete Representation Classifier with Salient Features Via Fixed-prototype

Efficient Classification Using Salient Regions

Dynamic Convolution Covariance Network Using Multi-Scale Feature Fusion for Remote Sensing Scene Image Classification

Accurate salient object detection via dense recurrent connections and residual-based hierarchical feature integration.

Improving the Separability of Deep Features with Discriminative Convolution Filters for RSI Classification.

Duplex-Hierarchy Representation Learning for Remote Sensing Image Classification

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN

Deep Discriminative Representation Learning with Attention Map for Scene Classification

Towards Learning Spatially Discriminative Feature Representations

DRCNN: Dynamic Routing Convolutional Neural Network for Multi-View 3D Object Recognition

A Remote-Sensing Scene-Image Classification Method Based on Deep Multiple-Instance Learning with a Residual Dense Attention ConvNet

Multi-Scale and Multi-Network Deep Feature Fusion for Discriminative Scene Classification of High-Resolution Remote Sensing Images

DECOR: Dynamic Decoupling and Multi-Objective Optimization for Long-tailed Remote Sensing Image Classification

A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling

Learning Implicit Class Knowledge for RGB-D Co-Salient Object Detection With Transformers

DBCvT: Double Branch Convolutional Transformer for Medical Image Classification

An Efficient Hyperspectral Image Classification Method Using Deep Fusion of 3-D Discrete Wavelet Transform and CNN.

High-Resolution Remote Sensing Image Classification Method Based on Convolutional Neural Network and Restricted Conditional Random Field

A Robust Feature Downsampling Module for Remote-Sensing Visual Tasks

Query-Expanded Collaborative Representation Based Classification with Class-Specific Prototypes for Object Recognition

CR-CAM: Generating explanations for deep neural networks by contrasting and ranking features