Data Assimilation using ERA5, ASOS, and the U-STN model for Weather Forecasting over the UK

Wenqi Wang,Jacob Bieker,Rossella Arcucci,César Quilodrán-Casas
2024-01-15
Abstract:In recent years, the convergence of data-driven machine learning models with Data Assimilation (DA) offers a promising avenue for enhancing weather forecasting. This study delves into this emerging trend, presenting our methodologies and outcomes. We harnessed the UK's local ERA5 850 hPa temperature data and refined the U-STN12 global weather forecasting model, tailoring its predictions to the UK's climate nuances. From the ASOS network, we sourced T2m data, representing ground observations across the UK. We employed the advanced kriging method with a polynomial drift term for consistent spatial resolution. Furthermore, Gaussian noise was superimposed on the ERA5 T850 data, setting the stage for ensuing multi-time step synthetic observations. Probing into the assimilation impacts, the ASOS T2m data was integrated with the ERA5 T850 dataset. Our insights reveal that while global forecast models can adapt to specific regions, incorporating atmospheric data in DA significantly bolsters model accuracy. Conversely, the direct assimilation of surface temperature data tends to mitigate this enhancement, tempering the model's predictive prowess.
Atmospheric and Oceanic Physics,Machine Learning
What problem does this paper attempt to address?
This paper discusses how to improve the accuracy of weather forecasting in the UK by utilizing data assimilation (DA) techniques combined with ERA5, ASOS (Automatic Surface Observation System) data, and the U-STN model. In the study, the authors integrate machine learning models with data assimilation methods to enhance the adaptability of global weather forecast models to specific regional climate characteristics. They simulate multi-time-step observation data by adding Gaussian noise and compare the impacts of different data sources (including ground observation data and synthetic observation data) on the data assimilation process. The research finds that incorporating atmospheric data (such as the 850 hPa temperature from ERA5) into DA can significantly improve model accuracy, while directly assimilating surface temperature data (such as the 2-meter temperature data from ASOS) may weaken this enhancement and reduce the predictive ability of the model. This could be due to the mismatch between ground and atmospheric data and variations in data distribution characteristics. The paper proposes three improvement directions: optimizing data sources and interpolation methods to better match temperature data at different levels, adjusting model hyperparameters to enhance performance, and combining ground observation data with models that utilize multiple layers of atmospheric pressure levels (e.g., graph neural networks) to reduce interpolation errors and improve prediction accuracy. In conclusion, the paper aims to address how to optimize weather forecast models for the UK region using data assimilation and machine learning. It experimentally reveals the differences in the effects of different types of observation data in data assimilation and proposes strategies for future model performance improvements.