An Improved GEV Boosting Method for Imbalanced Data Classification with Application to Short-Term Rainfall Prediction

Shuaida He,Zhouping Li,Xinwei Liu
DOI: https://doi.org/10.1016/j.jhydrol.2022.128882
IF: 6.4
2023-01-01
Journal of Hydrology
Abstract:This paper considers the imbalanced binary classification problem by focusing on the application of the short-term rainfall forecasting in arid and semi-arid regions. Specifically, we present a novel boosting-type method by utilizing the generalized extreme value (GEV) distribution as the link function and applying a gradient tree boosting algorithm to capture complex interactions among covariates. The proposed method has several appealing advantages such as, it can identify rare rainfall events as well as quantifying the uncertainties; it is data-driven that without any assumption on the relationship between the covariates and the rainfall event; the fitted model is highly interpretable, making it a useful tool for studying the rainfall mechanisms in arid and semi-arid regions. Experiments on two real-world datasets show that our approach outperforms its competing methods.
What problem does this paper attempt to address?