Transfer Learning with Pretrained Remote Sensing Transformers

Anthony Fuller,Koreen Millard,James R. Green

DOI: https://doi.org/10.48550/arXiv.2209.14969

2022-09-29

Abstract:Although the remote sensing (RS) community has begun to pretrain transformers (intended to be fine-tuned on RS tasks), it is unclear how these models perform under distribution shifts. Here, we pretrain a new RS transformer--called SatViT-V2--on 1.3 million satellite-derived RS images, then fine-tune it (along with five other models) to investigate how it performs on distributions not seen during training. We split an expertly labeled land cover dataset into 14 datasets based on source biome. We train each model on each biome separately and test them on all other biomes. In all, this amounts to 1638 biome transfer experiments. After fine-tuning, we find that SatViT-V2 outperforms SatViT-V1 by 3.1% on in-distribution (matching biomes) and 2.8% on out-of-distribution (mismatching biomes) data. Additionally, we find that initializing fine-tuning from the linear probed solution (i.e., leveraging LPFT [1]) improves SatViT-V2's performance by another 1.2% on in-distribution and 2.4% on out-of-distribution data. Next, we find that pretrained RS transformers are better calibrated under distribution shifts than non-pretrained models and leveraging LPFT results in further improvements in model calibration. Lastly, we find that five measures of distribution shift are moderately correlated with biome transfer performance. We share code and pretrained model weights. (<a class="link-external link-https" href="https://github.com/antofuller/SatViT" rel="external noopener nofollow">this https URL</a>)

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The main problems that this paper attempts to solve are the performance of pre - trained Transformers in Remote Sensing (RS) under Distribution Shifts and their calibration capabilities. Specifically, the researchers focus on the following points: 1. **The impact of pre - training on performance**: The researchers pre - trained a new Remote Sensing Transformer model - SatViT - V2 and explored its transfer learning ability between different biomes. In particular, they hope to understand whether pre - training can improve the performance of the model on unseen distributions. 2. **The effect of Linear Probe Fine - Tuning (LPFT)**: The researchers also explored the effect of using the Linear Probe Fine - Tuning method to fine - tune the pre - trained model. Can this method improve the performance of the model under Distribution Shifts while maintaining or improving its performance in calibration? 3. **Model calibration capabilities**: The researchers evaluated whether the predicted probabilities of the pre - trained model can accurately reflect the uncertainty of its predictions during transfer learning between different biomes. This involves the calibration capabilities of the model, that is, whether the probabilities predicted by the model are consistent with the actually correct probabilities. 4. **Measurement of Distribution Shifts**: The researchers proposed five methods for measuring Distribution Shifts and explored the correlation between these measurements and the performance of the model in transfer learning between different biomes. This helps to predict the performance of the model on the target data set. Through these studies, the authors hope to provide valuable insights for researchers and practitioners in the field of Remote Sensing, especially on how to better understand and apply transfer learning techniques when using large pre - trained Transformer models.

Transfer Learning with Pretrained Remote Sensing Transformers

An Empirical Study of Remote Sensing Pretraining

Lightweight, Pre-trained Transformers for Remote Sensing Timeseries

Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery

Transformer based ensemble deep learning approach for remote sensing natural scene classification

ExPLoRA: Parameter-Efficient Extended Pre-Training to Adapt Vision Transformers under Domain Shifts

Onboard Satellite Image Classification for Earth Observation: A Comparative Study of ViT Models

Transferring CNN With Adaptive Learning for Remote Sensing Scene Classification

Advancing Plain Vision Transformer Toward Remote Sensing Foundation Model

Transformers for Remote Sensing: A Systematic Review and Analysis

Boosting Vanilla Lightweight Vision Transformers Via Re-parameterization

Multi-label remote sensing classification with self-supervised gated multi-modal transformers

SatlasPretrain: A Large-Scale Dataset for Remote Sensing Image Understanding

Automated classification of remote sensing satellite images using deep learning based vision transformer

Vision Transformer for Multispectral Satellite Imagery: Advancing Landcover Classification*

Vision Transformer With Contrastive Learning for Remote Sensing Image Scene Classification

Self-supervised Vision Transformers for Land-cover Segmentation and Classification

Cross-resolution land cover classification using outdated products and transformers

Deep Learning Model Transfer in Forest Mapping using Multi-source Satellite SAR and Optical Images

On the Surprising Effectiveness of Attention Transfer for Vision Transformers