Few-shot remote sensing image scene classification: Recent advances, new baselines, and future trends
Chunping Qiu,Xiaoyu Zhang,Xiaochong Tong,Naiyang Guan,Xiaodong Yi,Ke Yang,Junjie Zhu,Anzhu Yu
DOI: https://doi.org/10.1016/j.isprsjprs.2024.02.005
IF: 12.7
2024-03-01
ISPRS Journal of Photogrammetry and Remote Sensing
Abstract:Remote sensing image scene classification (RSI-SC) is crucial for various high-level applications, including RSI retrieval, image captioning, and object detection. Deep learning-based methods can accurately predict scene categories. However, these approaches often require numerous labeled samples for training, limiting their practicality in real-world RS applications with scarce label resources. In contrast, few-shot remote sensing image scene classification (FS-RSI-SC) has garnered substantial research interest owing to its potential to mitigate the need for extensive training samples. In recent years, there has been a surge in studies on FS-RSI-SC. This paper presents a comprehensive overview of FS-RSI-SC research, categorizing existing methods into two groups. The first group comprises approaches based on data augmentation, transfer learning, metric learning, and meta-learning. Our analysis reveals that most existing FS-RSI-SC methods fall into the meta-learning category, employing attention mechanisms, self-supervised learning (SSL), and feature fusion techniques for enhanced performance. Additionally, transfer learning-based methods consistently outperform other approaches in this category. The second group is centered around large-scale pre-training, which has demonstrated remarkable competitiveness across various tasks, including FS-RSI-SC. This special group of methods has shown considerable potential and is expected to attract more attention with the increasing popularity of large-scale pre-training and the unimodal and multimodal foundation models. Moreover, we proposed a pipeline that harnesses the capabilities of powerful large vision-language models (VLMs) as image encoders, establishing new baselines for FS-RSI-SC on commonly used datasets under standard experimental settings. Our empirical results validated the effectiveness of utilizing large VLMs and highlighted their potential for FS-RSI-SC. Through a joint analysis of state-of-the-art methods and our experiments with VLMs, we identified the prevailing challenges in FS-RSI-SC and outlined promising directions for future research.
imaging science & photographic technology,remote sensing,geography, physical,geosciences, multidisciplinary