Abstract:As a content management technique, remote sensing (RS) scene classification (RSSC) always attracts researchers’ attention. In the past decades, many successful methods have been proposed. Nevertheless, their prerequisite is that there are large labeled datasets, which is a strict demand in practice. To resolve this contradiction, developing RSSC models with the help of few-shot learning (FSL) has become popular. Due to lacking prior knowledge, most of the existing few-shot RSSC models pay attention to the learning algorithm. However, they do not attach importance to the complex contents within RS scenes and the intricate interclass/intraclass relations between RS scenes. This would influence their performance negatively. In this article, we propose a new few-shot RSSC model named multipretext-task prototypes guided dynamic contrastive learning network (MPCL-Net). MPCL-Net consists of a multipretext tasks generation submodule, a deep feature learning submodule, and a joint optimization submodule. First, two RS-oriented pretext tasks are constructed under the self-supervised learning (SSL) framework in the multipretext tasks generation submodule, which aims to explore multiscale and rotation-invariant information from RS scenes. Second, a simple convolutional neural network (CNN) is developed in the deep feature learning submodule to transform the RS scenes into visual features. Third, three loss functions are formulated and integrated into the joint optimization submodule. Their goals are to fully capture the diverse land covers within RS scenes and compact/separate the intraclass/interclass samples with limited supervision. Finally, our MPCL-Net can be trained in a meta way. The positive results counted on the three public RS scene datasets confirm that our MPCL-Net is helpful to RSSC tasks under the few-shot scenario. Our source codes are available at https://github.com/TangXu-Group/Remote-Sensing-Images-Classification/tree/main/MPCL.

Supplementary for APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP

APPLeNet: Visual Attention Parameterized Prompt Learning for Few-Shot Remote Sensing Image Generalization using CLIP

LPNet: A Remote Sensing Scene Classification Method Based on Large Kernel Convolution and Parameter Fusion

A lightweight and stochastic depth residual attention network for remote sensing scene classification

Few-Shot Scene Classification of Optical Remote Sensing Images Leveraging Calibrated Pretext Tasks

Multi-pretext-task Prototypes Guided Dynamic Contrastive Learning Network for Few-shot Remote Sensing Scene Classification

RS-SSKD: Self-Supervision Equipped with Knowledge Distillation for Few-Shot Remote Sensing Scene Classification

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

Attention-Based Contrastive Learning for Few-Shot Remote Sensing Image Classification

SCL-MLNet: Boosting Few-Shot Remote Sensing Scene Classification via Self-Supervised Contrastive Learning

SPNet: Siamese-Prototype Network for Few-Shot Remote Sensing Image Scene Classification

Attention Based Network for Remote Sensing Scene Classification.

DLA-MatchNet for Few-Shot Remote Sensing Image Scene Classification

PatternNet: A benchmark dataset for performance evaluation of remote sensing image retrieval

Adaptive Discriminative Regions Learning Network for Remote Sensing Scene Classification

Scene Classification of Remote Sensing Images Based on Saliency Dual Attention Residual Network

Class-level Prototype Guided Multi-Scale Feature Learning for Remote Sensing Scene Classification with Limited Labels

Scene Classification with Recurrent Attention of VHR Remote Sensing Images

REMOTE SENSING IMAGE CLASSIFICATION WITH THE SEN12MS DATASET

An Efficient and Lightweight Convolutional Neural Network for Remote Sensing Image Scene Classification