WeatherFormer: A Pretrained Encoder Model for Learning Robust Weather Representations from Small Datasets

Adib Hasan,Mardavij Roozbehani,Munther Dahleh
2024-05-23
Abstract:This paper introduces WeatherFormer, a transformer encoder-based model designed to learn robust weather features from minimal observations. It addresses the challenge of modeling complex weather dynamics from small datasets, a bottleneck for many prediction tasks in agriculture, epidemiology, and climate science. WeatherFormer was pretrained on a large pretraining dataset comprised of 39 years of satellite measurements across the Americas. With a novel pretraining task and fine-tuning, WeatherFormer achieves state-of-the-art performance in county-level soybean yield prediction and influenza forecasting. Technical innovations include a unique spatiotemporal encoding that captures geographical, annual, and seasonal variations, adapting the transformer architecture to continuous weather data, and a pretraining strategy to learn representations that are robust to missing weather features. This paper for the first time demonstrates the effectiveness of pretraining large transformer encoder models for weather-dependent applications across multiple domains.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning,Atmospheric and Oceanic Physics
What problem does this paper attempt to address?
This paper introduces a pre-training model called WEATHER FORMER, which aims to learn robust weather features from a small amount of observations. In order to address the challenges of modeling complex weather dynamics in fields such as agriculture, epidemiology, and climate science with small data sets, this model is pre-trained based on satellite measurements and can handle continuous weather data while capturing geographical, yearly, and seasonal variations. WEATHER FORMER achieves state-of-the-art performance in soybean yield prediction and influenza prediction in New York City, and its potential can be applied to multiple domains such as crop yield, disease outbreaks, and environmental phenomenon prediction. The paper also proposes a new pre-training strategy to address the issue of pre-training large models on continuous weather data. By pre-training the model on large-scale meteorological datasets and then fine-tuning on small-scale datasets, the predictive accuracy of downstream tasks can be improved.