A Novel Fusion-Based Methodology for Drought Forecasting

Huihui Zhang,Hugo A. Loaiciga,Tobias Sauter
DOI: https://doi.org/10.3390/rs16050828
IF: 5
2024-02-29
Remote Sensing
Abstract:Accurate drought forecasting is necessary for effective agricultural and water resource management and for early risk warning. Various machine learning models have been developed for drought forecasting. This work developed and tested a fusion-based ensemble model, namely, the stacking (ST) model, that integrates extreme gradient boosting (XGBoost), random forecast (RF), and light gradient boosting machine (LightGBM) for drought forecasting. Additionally, the ST model employs the SHapley Additive exPlanations (SHAP) algorithm to interpret the relationship between variables and forecasting results. Multi-source data that encompass meteorological, vegetation, anthropogenic, landcover, climate teleconnection patterns, and topological characteristics were incorporated in the proposed ST model. The ST model forecasts the one-month lead standardized precipitation evapotranspiration index (SPEI) at a 12 month scale. The proposed ST model was applied and tested in the German federal states of Brandenburg and Berlin. The results show that the ST model outperformed the reference persistence model, XGBboost, RF, and LightGBM, achieving an average coefficient of determination (R2) value of 0.845 in each month in 2018. The spatiotemporal Moran's I method indicates that the ST model captures non-stationarity in modeling the statistical association between predictors and the meteorological drought index and outperforms the other three models (i.e., XGBoost, RF, and LightGBM). Global sensitivity analysis indicates that the ST model is influenced by a combination of environmental variables, with the most sensitive being the preceding drought indices. The accuracy and versatility of the ST model indicate that this is a promising approach for forecasting drought and other environmental phenomena.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of meteorological drought prediction. Specifically: 1. **Research Background**: - Drought is a natural disaster that occurs in almost all regions of the world. Particularly in Europe, severe droughts can have significant impacts on agriculture, society, and ecosystems. - Accurate drought forecasting is crucial for effective agricultural and water resource management as well as early risk warning. 2. **Research Objectives**: - Develop a stacking method that integrates multiple machine learning models for meteorological drought prediction. - Improve prediction accuracy by integrating three algorithms: Extreme Gradient Boosting (XGBoost), Random Forest (RF), and Light Gradient Boosting Machine (LightGBM). - Use SHapley Additive exPlanations (SHAP) algorithm to explain the relationship between variables and prediction results. - Combine multi-source data (including meteorological, vegetation, human activities, land cover, climate teleconnection patterns, and topographical features) for prediction. - Predict the Standardized Precipitation Evapotranspiration Index (SPEI) with a lead time of 1 month, tested in Brandenburg, Germany, and Berlin. 3. **Main Contributions**: - The proposed stacking model (ST model) achieved an average coefficient of determination (R²) value of 0.845 in 2018, outperforming the benchmark persistence model and other individual models (such as XGBoost, RF, and LightGBM). - Global sensitivity analysis indicates that the ST model is influenced by a combination of environmental variables, with the previous period's drought index being the most sensitive factor. - The accuracy and generalizability of the model suggest that this is a promising approach for predicting drought and other environmental phenomena.