An ensemble of data-driven weather prediction models for operational sub-seasonal forecasting

Jonathan A. Weyn,Divya Kumar,Jeremy Berman,Najeeb Kazmi,Sylwester Klocek,Pete Luferenko,Kit Thambiratnam
2024-03-23
Abstract:We present an operations-ready multi-model ensemble weather forecasting system which uses hybrid data-driven weather prediction models coupled with the European Centre for Medium-range Weather Forecasts (ECMWF) ocean model to predict global weather at 1-degree resolution for 4 weeks of lead time. For predictions of 2-meter temperature, our ensemble on average outperforms the raw ECMWF extended-range ensemble by 4-17%, depending on the lead time. However, after applying statistical bias corrections, the ECMWF ensemble is about 3% better at 4 weeks. For other surface parameters, our ensemble is also within a few percentage points of ECMWF's ensemble. We demonstrate that it is possible to achieve near-state-of-the-art subseasonal-to-seasonal forecasts using a multi-model ensembling approach with data-driven weather prediction models.
Atmospheric and Oceanic Physics,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the following key issues: 1. **Extending Forecast Time**: Many existing data-driven weather forecasting models have not been tested on the sub-seasonal to seasonal (S2S) time scales of 2 to 6 weeks. Although these time scales are crucial for applications such as agriculture and risk management, existing models may not generalize well. 2. **Probabilistic Forecasting**: Existing models primarily aim for the accuracy of deterministic forecasts, while probabilistic forecasts are often more useful for decision-making. Although some papers have developed ensemble models, these models use different methods and have not been compared. 3. **Training Loss Function**: Data-driven weather models trained with regression loss functions produce overly smooth deterministic forecasts, limiting the model's ability to capture various weather phenomena. This makes it difficult to evaluate the actual skill of different models through metrics optimized by ensemble forecasts, such as Root Mean Square Error (RMSE). 4. **Combining Model Architecture Advantages**: Data-driven weather forecasting models have always sought to be the most accurate, but no method has been proposed to leverage the advantages of various model architectures or to combine data-driven and traditional Numerical Weather Prediction (NWP) models. To address these issues, the paper proposes an ensemble of data-driven weather prediction models for operational sub-seasonal forecasting. This ensemble model has been operationalized, tested for extended forecast times, and provides probabilistic forecasts. Additionally, multiple models based on various architectures were trained to demonstrate the feasibility of a multi-model data-driven approach, and a hybrid ensemble method combining data-driven and traditional NWP models was proposed.