Abstract:Convolutional neural networks (CNN) have attracted tremendous attention in the remote sensing community due to its excellent performance in different domains. Especially for remote sensing scene classification, the CNN-based methods have brought a great breakthrough. However, it is not feasible to fully design and train a new CNN model for remote sensing scene classification, as this usually requires a large number of training samples and high computational costs. To alleviate these limitations of fully training a new model, some work attempts to use the pretrained CNN models as feature extractors to build feature representation of scene images for classification and has achieved impressive results. In this scheme, how to construct feature representation of scene image via the pretrained CNN model becomes the key process. Existing studies paid a little attention to build more discriminative feature representation by exploring the potential benefits of multilayer features from a single CNN model and different feature representations from multiple CNN models. To this end, this paper presents a fusion strategy to build the feature representation of the scene images by integrating multilayer features of a single pretrained CNN model, and extends it to a framework of multiple CNN models. For these purposes, a multiscale improved Fisher kernel coding method is used to build feature representation of the scene images on convolutional layers, and a feature fusion approach based on two feature subspace learning methods [principal component analysis (PCA)/spectral regression kernel discriminant analysis and PCA/spectral regression kernel locality preserving projection] is proposed to construct final fused features for scene classification. For validation and comparison purposes, the proposed approaches are evaluated with two challenging high-resolution remote sensing datasets and shows the competitive performance compared with existing state-of-the-art baselines such as fully trained CNN models, fine tuning CNN models, and other related works.

Combing Triple-Part Features of Convolutional Neural Networks for Scene Classification in Remote Sensing

A lightweight and stochastic depth residual attention network for remote sensing scene classification

Fusing Deep Local And Global Features For Remote Sensing Image Scene Classification

Remote Sensing Scene Classification Using Sparse Representation-Based Framework with Deep Feature Fusion

Attention-Aware Deep Feature Embedding for Remote Sensing Image Scene Classification

Fusing Deep Features by Kernel Collaborative Representation for Remote Sensing Scene Classification

A Deep Neural Network Combined With Context Features for Remote Sensing Scene Classification

Transferring Deep Convolutional Neural Networks for the Scene Classification of High-Resolution Remote Sensing Imagery

Scene Classification Of High Resolution Remote Sensing Images Using Convolutional Neural Networks

Effective Multiscale Residual Network With High-Order Feature Representation for Optical Remote Sensing Scene Classification

When CNNs Meet Vision Transformer: A Joint Framework for Remote Sensing Scene Classification

Feature and Model Level Fusion of Pretrained CNN for Remote Sensing Scene Classification

Aggregating Rich Hierarchical Features for Scene Classification in Remote Sensing Imagery

Mining Hierarchical Information of CNNs for Scene Classification of VHR Remote Sensing Images.

Multi-Scale and Multi-Network Deep Feature Fusion for Discriminative Scene Classification of High-Resolution Remote Sensing Images

Scene Classification Based On A Deep Random-Scale Stretched Convolutional Neural Network

Integrating Multilayer Features of Convolutional Neural Networks for Remote Sensing Scene Classification.

Using Cnn-Based High-Level Features for Remote Sensing Scene Classification

Feature Sparsity in Convolutional Neural Networks for Scene Classification of Remote Sensing Image.

Scene Classification of High-Resolution Remote Sensing Image Using Transfer Learning with Multi-model Feature Extraction Framework.

Scene Classification Based on Multiscale Convolutional Neural Network.