GPTCast: a weather language model for precipitation nowcasting

Gabriele Franch,Elena Tomasi,Rishabh Wanjari,Virginia Poli,Chiara Cardinali,Pier Paolo Alberoni,Marco Cristoforetti

2024-09-24

Abstract:This work introduces GPTCast, a generative deep-learning method for ensemble nowcast of radar-based precipitation, inspired by advancements in large language models (LLMs). We employ a GPT model as a forecaster to learn spatiotemporal precipitation dynamics using tokenized radar images. The tokenizer is based on a Quantized Variational Autoencoder featuring a novel reconstruction loss tailored for the skewed distribution of precipitation that promotes faithful reconstruction of high rainfall rates. The approach produces realistic ensemble forecasts and provides probabilistic outputs with accurate uncertainty estimation. The model is trained without resorting to randomness, all variability is learned solely from the data and exposed by model at inference for ensemble generation. We train and test GPTCast using a 6-year radar dataset over the Emilia-Romagna region in Northern Italy, showing superior results compared to state-of-the-art ensemble extrapolation methods.

Machine Learning,Atmospheric and Oceanic Physics

What problem does this paper attempt to address?

The paper aims to address the problem of short-term precipitation forecasting (nowcasting), particularly for early warnings of water-related disasters such as floods and landslides caused by sudden precipitation. Traditional methods like the Lagrangian extrapolation have limitations in handling convective precipitation, while Numerical Weather Prediction (NWP) models can predict the probability and intensity of precipitation over larger areas but have limited accuracy on smaller geographic and temporal scales. Therefore, this paper proposes a new method called GPTCast, which is a generative deep learning approach that applies large language model (LLM) techniques from the natural language processing domain to the tokenization and spatiotemporal prediction of radar precipitation images. Specifically, GPTCast utilizes a tokenizer based on Vector Quantized Variational Autoencoder (VQ-VAE) to learn how to convert radar images into discrete representations and employs an improved loss function (MWAE) to better reconstruct high rainfall rate scenarios. Then, the method uses an autoregressive model similar to the GPT-2 architecture for spatiotemporal forecasting, thereby generating reliable ensemble forecasts and providing probabilistic outputs with accurate uncertainty estimates. In this way, GPTCast can learn all variations from the data without introducing randomness and generate ensemble forecasts during inference. Experimental results show that GPTCast outperforms existing state-of-the-art ensemble extrapolation methods on a 6-year radar dataset from the Emilia-Romagna region in Italy.

GPTCast: a weather language model for precipitation nowcasting

Generative Modeling of High-resolution Global Precipitation Forecasts

Precipitation nowcasting with generative diffusion models

Precipitation Nowcasting Using Physics Informed Discriminator Generative Models

Skilful precipitation nowcasting using deep generative models of radar

Self-clustered GAN for precipitation nowcasting

Exploring Geometric Deep Learning For Precipitation Nowcasting

Extreme Precipitation Nowcasting using Transformer-based Generative Models

CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling

Global Precipitation Nowcasting of Integrated Multi-satellitE Retrievals for GPM: A U-Net Convolutional LSTM Architecture

A Spatio-Temporal Graph-Guided Convolutional LSTM for Tropical Cyclones Precipitation Nowcasting.

Skilful Precipitation Nowcasting Using NowcastNet

Integrating Spatio-Temporal and Generative Adversarial Networks for Enhanced Nowcasting Performance

CLGAN: a generative adversarial network (GAN)-based video prediction model for precipitation nowcasting

Convcast: An embedded convolutional LSTM based architecture for precipitation nowcasting using satellite data

Precipitaion Nowcasting using Deep Neural Network

PrecipLSTM: A Meteorological Spatiotemporal LSTM for Precipitation Nowcasting

DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting

A conditional Generative Adversarial network model for the Weather4Cast 2024 Challenge

Precipitation Nowcasting: Leveraging bidirectional LSTM and 1D CNN