Hybrid machine learning models for groundwater level prediction in a snow‐dominated region: An evaluation of EEMD, VMD and EWT decomposition techniques

Kadir Gezici,Okan Mert Katipoğlu,Selim Şengül
DOI: https://doi.org/10.1002/hyp.15169
IF: 3.784
2024-05-19
Hydrological Processes
Abstract:This study aims to investigate the variations in GWLs in a mountainous and snow‐dominated basin and select the most suitable model and methodology for GWL estimation using different machine learning and signal decomposition methods. All data decomposition approaches (VMD, EEMD and EWT) used in the study significantly improved the accuracy of GWL prediction by the ELM algorithm. The findings of the study can serve as a valuable tool for decision‐makers and policymakers, enabling them to make informed decisions regarding aquifer monitoring, irrigation planning and the effective management of water resources, particularly at the regional level. Water scarcity is a pressing issue, intensified by factors such as population growth and industrialization. Hence, it is crucial to monitor, conserve and analyse groundwater resources, which are essential sources of clean and usable water. This study examines changes in groundwater levels (GWLs) in northeastern Turkey's mountainous and snow‐covered area. The primary objective is to assess the effectiveness of integrated machine learning models, specifically, the extreme learning machine (ELM) technique combined with signal decomposition techniques such as ensemble empirical mode decomposition (EEMD), variational mode decomposition (VMD) and empirical wavelet transform (EWT) for monthly GWL prediction models. Seventy percent of the accessible data is allocated for training, while 30% is designated for testing. A correlation matrix involving precipitation, temperature, relative humidity and GWL parameters is generated with inputs that possess significant correlations being selected, such as GWLt−1, GWLt−2, RHt, RHt−1 and RHt−2. To evaluate model results, various metrics, including mean squared error, mean absolute error, mean absolute percentage error, mean bias error, bias factor, determination coefficient, Nash‐Sutcliffe efficiency, as well as tools such as box plots, Taylor diagrams and radar charts, are utilized to compare outcomes during the interpretation phase. The results of the analyses show that applying data decomposition methods such as EEMD, VMD and EWT significantly improves the performance of the ELM algorithm in predicting GWLs. VMD‐ELM is the most accurate for GWL forecasting among the approaches examined. The R2 values of the most successful models established in the two wells are 0.993 and 0.905. The outcomes of this research hold significance for decision‐makers and policymakers as it offers informative insights into aquifer surveillance, irrigation strategizing and efficient administration of water resources.
water resources
What problem does this paper attempt to address?