Self-Supervised Pre-Training for Precipitation Post-Processor

Sojung An,Junha Lee,Jiyeon Jang,Inchae Na,Wooyeon Park,Sujeong You
2024-02-20
Abstract:Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation post-processor consists of (i) employing self-supervised pre-training, where the parameters of the encoder are pre-trained on the reconstruction of the masked variables of the atmospheric physics domain; and (ii) conducting transfer learning on precipitation segmentation tasks (the target domain) from the pre-trained encoder. In addition, we introduced a heuristic labeling approach to effectively train class-imbalanced datasets. Our experiments on precipitation correction for regional NWP show that the proposed method outperforms other approaches.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Increasing lead time for precipitation forecasts**: To prevent hazardous weather events, it is crucial to obtain sufficient lead time for local precipitation forecasts. Climate change caused by global warming has increased the difficulty of accurately predicting severe precipitation events, such as heavy rain. 2. **Addressing data imbalance issues**: Data imbalance is common in precipitation measurement data, especially in the prediction of extreme weather events like heavy rainfall. The extreme imbalance of samples in the dataset poses a challenge for precipitation forecasting. To address these issues, the researchers propose a deep learning-based post-processing method for precipitation, which includes: - **Self-supervised pre-training**: Pre-training encoder parameters by masking variables in the atmospheric physics domain. - **Transfer learning**: Applying the pre-trained encoder to the precipitation segmentation task (target domain). - **Heuristic labeling method**: Introducing a continuous labeling method to effectively train the class-imbalanced dataset, reducing uncertainty by smoothing probability values. Experimental results show that the proposed method outperforms other methods in precipitation correction for regional numerical weather prediction (NWP) models.