Transformer-enhanced spatiotemporal neural network for post-processing of precipitation forecasts

Mingheng Jiang,Bin Weng,Jiazhen Chen,Tianqiang Huang,Feng Ye,Lijun You
DOI: https://doi.org/10.1016/j.jhydrol.2024.130720
IF: 6.4
2024-01-27
Journal of Hydrology
Abstract:Numerical Weather Prediction (NWP) models are extensively utilized worldwide and have played a pivotal role in weather forecasting. Precipitation is subject to various intricate factors, rendering it one of the most challenging factors to predict. Additional post-processing steps are required to reduce biases and achieve reliable predictions for precipitation-related decision-making. In this work, we propose a transformer-enhanced spatiotemporal neural network called TransLSTMUNet for short- and medium-range precipitation post-processing. Firstly, TransLSTMUNet employs convolutional operators to extract localized meteorological features. Secondly, it capitalizes on transformer architecture to enrich these extracted features with a broader, global perspective of spatial information. Thirdly, TransLSTMUNet leverages ConvLSTM to further enhance the features with temporal information. Furthermore, to address challenges posed by imbalanced distribution of precipitation intensity, we design a novel loss function called quantile weighted mean squared error (QWMSE). QWMSE simultaneously considers both normal and intense precipitation during the model' s training phase. In the experiments, the THORPEX Interactive Grand Global Ensemble (TIGGE) dataset provided by the European Centre for Medium-Range Weather Forecasts (ECMWF) is used as input for post-processing the precipitation forecasts. The experiments show that the precipitation forecasts post-processed by TransLSTMUNet exhibit the best overall performance compare with the eight post-processing baselines. It significantly improves the forecast performance of TIGGE forecasts. Specifically, TransLSTMUNet enhances the accuracy (ACC) metric by 12.14 % and increases the threat scores (TS) for 24-hour accumulated precipitation of 0.1 mm, 10.0 mm, 25.0 mm, and 50.0 mm by 8.30 %, 9.77 %, 31.60 %, and 51.25 % respectively. By effectively integrating the strengths of convolutional and transformer methodologies, the proposed TransLSTMUNet model offers a novel approach for post-processing precipitation forecasting. This model design has the potential to inspire various other research avenues within the hydrological domain and beyond.
geosciences, multidisciplinary,water resources,engineering, civil
What problem does this paper attempt to address?