High performance machine learning approach for reference evapotranspiration estimation

Mohammed S. Aly,Saad M. Darwish,Ahmed A. Aly
DOI: https://doi.org/10.1007/s00477-023-02594-y
IF: 3.821
2023-11-04
Stochastic Environmental Research and Risk Assessment
Abstract:Abstract Accurate reference evapotranspiration (ET 0 ) estimation has an effective role in reducing water losses and raising the efficiency of irrigation water management. The complicated nature of the evapotranspiration process is illustrated in the amount of meteorological variables required to estimate ET 0 . Incomplete meteorological data is the most significant challenge that confronts ET 0 estimation. For this reason, different machine learning techniques have been employed to predict ET 0 , but the complicated structures and architectures of many of them make ET 0 estimation very difficult. For these challenges, ensemble learning techniques are frequently employed for estimating ET 0 , particularly when there is a shortage of meteorological data. This paper introduces a powerful super learner ensemble technique for ET 0 estimation, where four machine learning models: Extra Tree Regressor, Support Vector Regressor, K-Nearest Neighbor and AdaBoost Regression represent the base learners and their outcomes used as training data for the meta learner. Overcoming the overfitting problem that affects most other ensemble methods is a significant advantage of this cross-validation theory-based approach. Super learner performances were compared with the base learners for their forecasting capabilities through different statistical standards, where the results revealed that the super learner has better accuracy than the base learners, where different combinations of variables have been used whereas Coefficient of Determination (R 2 ) ranged from 0.9279 to 0.9994 and Mean Squared Error (MSE) ranged from 0.0026 to 0.3289 mm/day but for the base learners R 2 ranged from 0.5592 to 0.9977, and MSE ranged from 0.0896 to 2.0118 mm/day therefore, super learner is highly recommended for ET 0 prediction with limited meteorological data.
environmental sciences,engineering, environmental,water resources, civil,statistics & probability
What problem does this paper attempt to address?