Abstract:Onboard land cover classification provides ever-updating land cover information, supporting various intelligent satellite applications that demand timely autonomous decision-making based on current and continuous land cover data. However, due to space, weight, and power constraints, satellites possess limited computational resources, rendering them unable to execute conventional land cover classification networks. In response to this challenge, we have designed a lightweight network for land cover classification featuring two efficient transformer attention mechanisms enhanced by multigranularity tokens. Diverging from traditional transformer attention mechanisms that solely capture token-to-token correlations at a single granularity, our approach splits the tokens into four segments and uses atrous convolutions across various dilation rates to aggregate token segments from diverse receptive fields, forming token segment combinations that encompass not only point information but also information from patches of varying sizes. These multigranularity tokens are subsequently processed through the windowed squeeze axial transformer attention (WSATA) and multigranularity bilevel routing attention (MGBRA) for feature enhancement. In another aspect, empirical observations reveal that prediction errors are more prone to manifest on land covers of small extent; however, conventional methods treat all pixels uniformly. This realization motivates us to propose a novel network-agnostic loss named connected component loss (CCL), which specifically targets small-scale land covers and their boundaries. Quantitative metrics and visual interpretations from comprehensive experiments confirm that our method attains state-of-the-art accuracy on two land cover classification datasets while exhibiting significantly faster inference speed than other lightweight networks, underscoring the practical potential of our method on embedded systems.

A Novel Lightweight Attention-Discarding Transformer for High-Resolution SAR Image Classification

A Novel Transformer Network with a CNN-Enhanced Cross-Attention Mechanism for Hyperspectral Image Classification

Towards SAR Automatic Target Recognition MultiCategory SAR Image Classification Based on Light Weight Vision Transformer

A Lightweight Transformer Network for Hyperspectral Image Classification

ViT-LSLA: Vision Transformer with Light Self-Limited-Attention

High Resolution SAR Image Classification Using Global-Local Network Structure Based on Vision Transformer and CNN

A Lightweight Dual-Branch Swin Transformer for Remote Sensing Scene Classification

Hyperspectral Image Classification Using Groupwise Separable Convolutional Vision Transformer Network

Deep Hierarchical Vision Transformer for Hyperspectral and LiDAR Data Classification

Lightweight Structure-aware Transformer Network for VHR Remote Sensing Image Change Detection

A novel dual-granularity lightweight transformer for vision tasks

Lite Vision Transformer with Enhanced Self-Attention

A lightweight hybrid vision transformer network for radar-based human activity recognition

A lightweight transformer with linear self‐attention for defect recognition

Hierarchical Attention Transformer for Hyperspectral Image Classification

LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition

Joint Classification of Hyperspectral and LiDAR Data Based on Adaptive Gating Mechanism and Learnable Transformer

Contrastive Learning With Context-Augmented Transformer for Change Detection in SAR Images

CViTF-Net: A Convolutional and Visual Transformer Fusion Network for Small Ship Target Detection in Synthetic Aperture Radar Images

A Lightweight Transformer With Multigranularity Tokens and Connected Component Loss for Land Cover Classification