A novel machine learning-based framework for the water quality parameters prediction using hybrid long short-term memory and locally weighted scatterplot smoothing methods

Ana Dodig,Elisa Ricci,Goran Kvascev,Milan Stojkovic
DOI: https://doi.org/10.2166/hydro.2024.273
IF: 3.058
2024-04-13
Journal of Hydroinformatics
Abstract:Water quality prediction is crucial for effective river stream management. Dissolved oxygen, conductivity and chemical oxygen demand are vital chemical parameters for water quality. Development of machine learning (ML) and deep learning (DL) methods made them widely used in this domain. Sophisticated DL techniques, especially long short-term memory (LSTM) networks, are required for accurate, real-time multi-step prediction. LSTM networks are effective in predicting water quality due to their ability to handle long-term dependencies in sequential data. We propose a novel hybrid approach for water quality parameters prediction combining DL with data smoothing method. The Sava river at the Jamena hydrological station serves as a case study. Our workflow uses LSTM networks alongside LOcally WEighted Scatterplot Smoothing (LOWESS) technique for data filtering. For comparison, Support Vector Regressor (SVR) is used as the baseline method. Performance is evaluated using Root Mean Squared Error (RMSE) and Coefficient of Determination R2 metrics. Results demonstrate that LSTM outperforms the baseline method, with an R2 score up to 0.9998 and RMSE of 0.0230 on the test set for dissolved oxygen. Over a 5-day prediction period, our approach achieves R2 score of 0.9912 and RMSE of 0.1610 confirming it as a reliable method for water quality parameters prediction several days ahead.
environmental sciences,computer science, interdisciplinary applications,engineering, civil,water resources
What problem does this paper attempt to address?