Multi-Year Time Series Transfer Learning: Application of Early Crop Classification

Matej Račič,Krištof Oštir,Anže Zupanc,Luka Čehovin Zajc
DOI: https://doi.org/10.3390/rs16020270
IF: 5
2024-01-11
Remote Sensing
Abstract:Crop classification is an important task in remote sensing with many applications, such as estimating yields, detecting crop diseases and pests, and ensuring food security. In this study, we combined knowledge from remote sensing, machine learning, and agriculture to investigate the application of transfer learning with a transformer model for variable length satellite image time series (SITS). The objective was to produce a map of agricultural land, reduce required interventions, and limit in-field visits. Specifically, we aimed to provide reliable agricultural land class predictions in a timely manner and quantify the necessary amount of reference parcels to achieve these outcomes. Our dataset consisted of Sentinel-2 satellite imagery and reference crop labels for Slovenia spanning over years 2019, 2020, and 2021. We evaluated adaptability through fine-tuning in a real-world scenario of early crop classification with limited up-to-date reference data. The base model trained on a different year achieved an average F1 score of 82.5% for the target year without having a reference from the target year. To increase accuracy with a new model trained from scratch, an average of 48,000 samples are required in the target year. Using transfer learning, the pre-trained models can be efficiently adapted to an unknown year, requiring less than 0.3% (1500) samples from the dataset. Building on this, we show that transfer learning can outperform the baseline in the context of early classification with only 9% of the data after 210 days in the year.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to apply transfer learning for early crop classification in satellite image time series (SITS) over many years. Specifically, the research aims to process satellite image time - series data of different lengths by using the Transformer model to achieve the following goals: 1. **Generate agricultural land maps**: Generate detailed distribution maps of agricultural land by early identification of crop types. 2. **Reduce the need for intervention**: Reduce the need for field visits by providing timely and reliable crop - classification predictions. 3. **Quantify the number of reference plots**: Determine the minimum number of reference plots required to achieve the above - mentioned goals. ### Research background - **Importance of crop classification**: Crop classification has many applications in remote sensing, such as estimating yield, detecting crop diseases and pests, and ensuring food security. - **Limitations of existing methods**: Most existing studies use data from a single year for training and testing, which limits the generalization ability of the model. Changes in meteorological conditions can affect the development of vegetation, resulting in different time - series characteristics of the same crop in different years. - **Advantages of transfer learning**: Through transfer learning, historical data can be used to pre - train the model and then fine - tune it in the target year, thereby improving the performance of the model, especially when the reference data is limited. ### Research methods - **Dataset**: The research used Sentinel - 2 satellite images and reference crop labels in Slovenia from 2019 to 2021. - **Model architecture**: The SITS - BERT model based on Transformer was adopted, which can process time - series data of different lengths and learn general spectral - temporal features using a self - supervised pre - training scheme. - **Experimental design**: - **Temporal generalization**: Evaluate the generalization ability of the model between different years. - **Under - sampling**: Study the impact of a limited sample size on the performance of the model. - **Transfer learning**: Use the pre - trained model and limited data in the target year for fine - tuning to optimize the performance of the model. - **Early classification**: Combine the above experiments to evaluate the impact of limited time information on the performance of the model, simulating the situation where new observation data gradually increases in practical applications. ### Main results - **Temporal generalization**: The model performs well in the training years and can also maintain high performance in unseen years. - **Under - sampling**: As the number of samples increases, the performance of the model gradually improves. When the number of samples reaches 48,000, the performance of the model trained from scratch exceeds that of the baseline model. - **Transfer learning**: After fine - tuning the pre - trained model in the target year, less than 0.3% of the samples are required to achieve good performance. - **Early classification**: In practical applications, the model can provide reliable crop - classification predictions early in the growing season, and only 9% of the data is required to exceed the performance of the baseline model after 210 days. ### Conclusion This research demonstrates the effectiveness and advantages of transfer learning in early crop classification, especially when the reference data is limited. By using historical data to pre - train the model and then fine - tuning it in the target year, the performance and generalization ability of the model can be significantly improved. This method provides strong support for crop monitoring and management in practical applications.