A Deep Learning Approach to Probabilistic Forecasting of Weather

Nick Rittler,Carlo Graziani,Jiali Wang,Rao Kotamarthi
DOI: https://doi.org/10.48550/arXiv.2203.12529
2022-03-25
Abstract:We discuss an approach to probabilistic forecasting based on two chained machine-learning steps: a dimensional reduction step that learns a reduction map of predictor information to a low-dimensional space in a manner designed to preserve information about forecast quantities; and a density estimation step that uses the probabilistic machine learning technique of normalizing flows to compute the joint probability density of reduced predictors and forecast quantities. This joint density is then renormalized to produce the conditional forecast distribution. In this method, probabilistic calibration testing plays the role of a regularization procedure, preventing overfitting in the second step, while effective dimensional reduction from the first step is the source of forecast sharpness. We verify the method using a 22-year 1-hour cadence time series of Weather Research and Forecasting (WRF) simulation data of surface wind on a grid.
Machine Learning,Atmospheric and Oceanic Physics
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the issue of probabilistic forecasting in weather prediction. Specifically, the authors propose a deep learning-based approach to achieve probabilistic forecasting through two consecutive machine learning steps: 1. **Dimensionality Reduction Step**: First, a dimensionality reduction mapping is learned to map high-dimensional predictive information to a low-dimensional space while retaining as much information about the forecast quantity as possible. 2. **Density Estimation Step**: Then, using the probabilistic machine learning technique of Normalizing Flows, the joint probability density of the reduced-dimensional predictive information and the forecast quantity is calculated. By re-normalizing this joint density, the conditional forecast distribution is obtained. ### Main Challenges 1. **High-Dimensional Data Processing**: Weather forecast data typically has high dimensionality, making direct probabilistic forecasting computationally challenging. 2. **Information Retention**: During the dimensionality reduction process, it is necessary to retain as much information related to the forecast quantity as possible. 3. **Model Calibration**: The forecast model needs to be calibrated to ensure that its probability distribution accurately reflects the uncertainty in the actual data. ### Method Overview 1. **Dimensionality Reduction Step**: - A neural network is used to learn a dimensionality reduction mapping \( T \), which maps high-dimensional predictive information \( X \) to a low-dimensional representation \( T(X) \). - The optimal dimensionality reduction mapping \( T \) is selected by maximizing the mutual information \( I(Y; T(X)) \). 2. **Density Estimation Step**: - The joint distribution \( P(Y, T(X)) \) is modeled using the Normalizing Flows technique. - By re-normalizing the joint distribution, the conditional forecast distribution \( P(Y | T(X)) \) is obtained. ### Validation Method The authors validated the proposed method using a 22-year, hourly time series dataset of surface wind speed grid data simulated by the Weather Research and Forecasting (WRF) model. ### Conclusion The method, through effective dimensionality reduction and non-parametric density estimation, can improve forecast accuracy while maintaining model calibration. This provides a new effective approach for probabilistic forecasting in weather prediction.