Interpolation of mountain weather forecasts by machine learning

Kazuma Iwase,Tomoyuki Takenawa
2024-08-14
Abstract:Recent advances in numerical simulation methods based on physical models and their combination with machine learning have improved the accuracy of weather forecasts. However, the accuracy decreases in complex terrains such as mountainous regions because these methods usually use grids of several kilometers square and simple machine learning models. While deep learning has also made significant progress in recent years, its direct application is difficult to utilize the physical knowledge used in the simulation. This paper proposes a method that uses machine learning to interpolate future weather in mountainous regions using forecast data from surrounding plains and past observed data to improve weather forecasts in mountainous regions. We focus on mountainous regions in Japan and predict temperature and precipitation mainly using LightGBM as a machine learning model. Despite the use of a small dataset, through feature engineering and model tuning, our method partially achieves improvements in the RMSE with significantly less training time.
Atmospheric and Oceanic Physics,Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issue of low accuracy in weather forecasting in mountainous regions. Specifically, existing Numerical Weather Prediction (NWP) methods experience a decline in forecasting accuracy in complex terrains such as mountainous areas. This is because these methods typically use grid cells that are several kilometers in size and combine simple machine learning models, which do not perform well in complex terrains like mountains. To solve this problem, the authors propose a new method that uses machine learning to "interpolate" future weather in mountainous regions. The method is implemented through the following steps: 1. **Data Source**: The study focuses on predicting the temperature at the summit of Mount Fuji and the precipitation in the Hakone area in the Kanto region of Japan. It utilizes observational data provided by the Japan Meteorological Agency, including meteorological observation data and forecast data from surrounding plain areas. 2. **Model Selection**: LightGBM is used as the machine learning model for prediction. This model can utilize current observational data and forecast data from surrounding plain areas, as well as past historical observational data, to improve the accuracy of weather forecasts in mountainous regions. 3. **Feature Engineering**: Feature engineering was conducted, including the use of past observational data, future forecast data (only from surrounding areas), and time series features (such as differencing and moving averages). 4. **Loss Function**: In addition to Mean Squared Error (MSE), a linear combination of binary cross-entropy and MSE was also tried as the loss function to improve prediction bias in non-rainfall situations. Experimental results show that for temperature prediction at Mount Fuji, the proposed method significantly outperforms existing weather forecasting services; for precipitation prediction in the Hakone area, it also shows good performance at certain times. Additionally, the study explores the impact of feature extraction on training time and overfitting, as well as the effect of different loss function choices on prediction results. Overall, the method can improve the accuracy of weather forecasts in mountainous regions to a certain extent.