Ensemble tree model prediction of summer precipitation in north china based on predictor selection strategy

Kai Wang,Shujuan Hu,Deqian Li,Jianjun Peng,Zihan Hao,Wenping He,Zhihai Zheng
DOI: https://doi.org/10.1007/s00382-024-07223-0
IF: 4.901
2024-04-14
Climate Dynamics
Abstract:Selection of predictors is a key issue in using machine learning (ML) models to perform short-term climate prediction, and it is also one of the main constraints on improved model prediction skills. To investigate this problem, three tree models (Random Forest (RF), extreme gradient boosting (XGBoost), and adaptive boosting (AdaBoost)) along with a novel predictor screening strategy to forecast summer precipitation in North China. We firstly confirmed that the prediction results obtained using the predictor screening strategy outperformed those based on the feature importance ranking within ML itself. The correlation coefficients between the predicted values and the observed values were 0.75 and 0.47 for the two predictor screening schemes, respectively. Subsequently, through comparison with both the Beijing Climate Center Climate System Model (BCC_CSM) and the forecast system on dynamic and analogue skills (FODAS), developed by the National Climate Center (NCC) of China, it was determined that the prediction results of the ensemble tree model based on the presented predictor screening strategy were significantly superior than those obtained using either the BCC_CSM or the FODAS. During the test period, the ensemble tree model exhibited a correlation coefficient of 0.75 between predictions and observations, surpassing the correlation coefficients of BCC_CSM (0.27) and FODAS (0.54). The results demonstrate that the predictor screening strategy proposed in this study has potential application value for improving the prediction skill of RF, XGBoost, and AdaBoost models, and highlight that predictor selection is the key procedure for improved ML models.
meteorology & atmospheric sciences
What problem does this paper attempt to address?