Evaluation of the support vector regression (SVR) and the random forest (RF) models accuracy for streamflow prediction under a data-scarce basin in Morocco

Bouchra Bargam,Abdelghani Boudhar,Christophe Kinnard,Hafsa Bouamri,Karima Nifa,Abdelghani Chehbouni
DOI: https://doi.org/10.1007/s42452-024-05994-z
2024-06-05
SN Applied Sciences
Abstract:Streamflow prediction is a key variable for water resources management. It becomes more important in semi-arid regions such as the Tensift river basin in Morocco, where water resources are facing a severe drought and the demand is continuously increasing. The present analysis focuses on evaluating Machine Learning techniques, namely support vector regression (SVR) and Random Forest (RF) against the multiple linear regression (MLR) for daily streamflow forecasting in the mountainous sub-basin of Rheraya between 2003 and 2016. The results show that SVR performed best, followed by RF and MLR. In measurable terms and regarding mean performance, SVR exhibited the higher Nash–Sutcliffe efficiency score (NSE = 0.59) and a lower root mean squared error (RMSE = 1.18  ) compared to RF (NSE = 0.53, RMSE = 1.18  ) and MLR (NSE = 0.54, RMSE = 1.01  ). Furthermore,the available time series was too short to properly capture the full range of streamflow variability, which reduced the prediction performance outside of the calibration conditions. These findings suggest that ML algorithms, particularly SVR, can provide accurate streamflow estimation useful for water resources management when trained on a representative period. The results highlight the capacity of Machine Learning algorithms, specifically SVR, to augment streamflow prediction for enhanced water resource management in arid regions.
What problem does this paper attempt to address?