Abstract:With more detailed spatial information being represented in very-high-resolution (VHR) remote sensing images, stringent requirements are imposed on accurate image classification. Due to the diverse land objects with intraclass variation and interclass similarity, efficient and fine classification of VHR images especially in complex scenes are challenging. Even for some popular deep learning (DL) frameworks, geometric details of land objects may be lost in deep feature levels, so it is difficult to maintain the highly detailed spatial information (e.g., edges, small objects) only relying on the last high-level layer. Moreover, many of the newly developed DL methods require massive well-labeled samples, which inevitably deteriorates the model generalization ability under the few-shot learning. Therefore, in this article, a lightweight shallow-to-deep feature fusion network (SDF 2 N) is proposed for VHR image classification, where the traditional machine learning (ML) and DL schemes are integrated to learn rich and representative information to improve the classification accuracy. In particular, the shallow spectral–spatial features are first extracted and then a novel triple-stage fusion (TSF) module is designed to learn the saliency and discriminative information at different levels for classification. The TSF module includes three feature fusion stages, that is, low-level spectral–spatial feature fusion, middle-level multiscale feature fusion, and high-level multilayer feature fusion. The proposed SDF 2 N takes the advantage of the shallow-to-deep features, which can extract representative and complementary information from crossing layers. It is important to note that even with limited training samples, the SDF 2 N still can achieve satisfying classification performance. Experimental results obtained on three real VHR remote sensing datasets including two multispectral and one airborne hyperspectral - mages covering complex urban scenarios confirm the effectiveness of the proposed approach compared with the state-of-the-art methods.

Learning Deep Classifiers With Deep Features

Research on Image Classification Method of Features of Combinatorial Convolution

Decompose Learning: Combine Feature Extraction and Classification

Two-level Hierarchical Feature Learning for Image Classification

Hierarchical Gate Network for Fine-Grained Visual Recognition.

Melanoma Classification in Dermoscopy Images via Ensemble Learning on Deep Neural Network

Classification and Representation Joint Learning Via Deep Networks.

Embedding Label Structures for Fine-Grained Feature Representation

Exemplar Based Deep Discriminative and Shareable Feature Learning for Scene Image Classification

A Shallow-to-Deep Feature Fusion Network for VHR Remote Sensing Image Classification

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition.

Multi-scale and Discriminative Part Detectors Based Features for Multi-label Image Classification.

HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition

Two-Stage Selective Ensemble of CNN via Deep Tree Training for Medical Image Classification

Group Based Deep Shared Feature Learning for Fine-grained Image Classification

A Deep Neural Network Combined With Context Features for Remote Sensing Scene Classification

MDFN: Multi-scale deep feature learning network for object detection

Hierarchical Self-Distilled Feature Learning for Fine-Grained Visual Categorization

Classifying airborne LiDAR point clouds via deep features learned by a multi-scale convolutional neural network

A Remote-Sensing Scene-Image Classification Method Based on Deep Multiple-Instance Learning with a Residual Dense Attention ConvNet

When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs