Abstract:Data‐driven approaches, most prominently deep learning, have become powerful tools for prediction in many domains. A natural question to ask is whether data‐driven methods could also be used to predict global weather patterns days in advance. First studies show promise but the lack of a common data set and evaluation metrics make intercomparison between studies difficult. Here we present a benchmark data set for data‐driven medium‐range weather forecasting (specifically 3–5 days), a topic of high scientific interest for atmospheric and computer scientists alike. We provide data derived from the ERA5 archive that has been processed to facilitate the use in machine learning models. We propose simple and clear evaluation metrics which will enable a direct comparison between different methods. Further, we provide baseline scores from simple linear regression techniques, deep learning models, as well as purely physical forecasting models. The data set is publicly available at https://github.com/pangeo‐data/WeatherBench and the companion code is reproducible with tutorials for getting started. We hope that this data set will accelerate research in data‐driven weather forecasting. WeatherBench provides a new benchmark to test data‐driven approaches to weather forecasting. Traditional weather models are based on the discretized equations governing the atmosphere. They perform very well for many tasks but are still found lacking for some others. Data‐driven approaches, such as deep learning, directly learn from the best available observations and could potentially produce better forecasts. In this paper, we define a benchmark task—predicting pressure and temperature across the globe 3 and 5 days ahead—which will hopefully lead to progress in data‐driven weather prediction and foster collaboration across disciplines. Benchmarks with strong baselines are a key ingredient for rapid progress on a problem Here, we define a benchmark for data‐driven global, medium‐range weather prediction The data are processed for convenient use in machine learning models, and a quickstart guide is provided

Probabilistic Solar Forecasting Benchmarks on a Standardized Dataset at Folsom, California

Probabilistic solar forecasting: Benchmarks, post-processing, verification

Benchmarks for Solar Radiation Time Series Forecasting

Probabilistic Forecasting of Photovoltaic Generation: an Efficient Statistical Approach

A review of very short-term wind and solar power forecasting

Benchmarks and Custom Package for Energy Forecasting

WeatherBench Probability: A benchmark dataset for probabilistic medium-range weather forecasting along with deep learning baseline models

A Bayesian Approach to Probabilistic Solar Irradiance Forecasting

Economics of Physics-Based Solar Forecasting in Power System Day-Ahead Scheduling

Probabilistic Solar Power Forecasting: Long Short-Term Memory Network vs Simpler Approaches

WeatherBench: A Benchmark Data Set for Data‐Driven Weather Forecasting

Scalable Multi‐site Photovoltaic Power Forecasting Based on Stream Computing

ChaosBench: A Multi-Channel, Physics-Based Benchmark for Subseasonal-to-Seasonal Climate Prediction

A Solar Time Based Analog Ensemble Method for Regional Solar Power Forecasting

Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive Survey

SKIPP'D: A SKy Images and Photovoltaic Power Generation Dataset for short-term solar forecasting

SubseasonalClimateUSA: A Dataset for Subseasonal Forecasting and Benchmarking

Gefcom2014 Probabilistic Solar Power Forecasting Based On K-Nearest Neighbor And Kernel Density Estimator

A Practical Probabilistic Benchmark for AI Weather Models

Prediction Interval Estimation and Deterministic Forecasting Model Using Ground-Based Sky Image

Completed Review of Various Solar Power Forecasting Techniques Considering Different Viewpoints