Abstract:Lakes play a crucial role in the global biogeochemical cycles through the transport, storage, and transformation of different biogeochemical compounds. Their regulatory service appears to be disproportionately important relative to their small areal extent, necessitating continuous monitoring. This study leverages the potential of optical remote sensing sensors, specifically Sentinel-2 Multispectral Imagery (MSI), to monitor and predict water quality parameters in lakes. Optically active parameters, such as chlorophyll a (CHL), total suspended matter (TSM), and colored dissolved matter (CDOM), can be directly detected using optical remote sensing sensors. However, the challenge lies in detecting non-optically active substances, which lack direct spectral characteristics. The capabilities of artificial intelligence applications can be used in the identification of optically non-active compounds from remote sensing data. This study aims to employ a machine learning approach (combining the Genetic Algorithm (GA) and Extreme Gradient Boost (XGBoost)) and in situ and Sentinel-2 Multispectral Imagery data to construct inversion models for 16 physical and biogeochemical water quality parameters including CHL, CDOM, TSM, total nitrogen (TN), total phosphorus (TP), phosphate (PO4), sulphate, ammonium nitrogen, 5-day biochemical oxygen demand (BOD5), chemical oxygen demand (COD), and the biomasses of phytoplankton and cyanobacteria, pH, dissolved oxygen (O2), water temperature (WT) and transparency (SD). GA_XGBoost exhibited strong predictive capabilities and it was able to accurately predict 10 biogeochemical and 2 physical water quality parameters. Additionally, this study provides a practical demonstration of the developed inversion models, illustrating their applicability in estimating various water quality parameters simultaneously across multiple lakes on five different dates. The study highlights the need for ongoing research and refinement of machine learning methodologies in environmental monitoring, particularly in remote sensing applications for water quality assessment. Results emphasize the need for broader temporal scopes, longer-term datasets, and enhanced model selection strategies to improve the robustness and generalizability of these models. In general, the outcomes of this study provide the basis for a better understanding of the role of lakes in the biogeochemical cycle and will allow the formulation of reliable recommendations for various applications used in the studies of ecology, water quality, the climate, and the carbon cycle.

Uncertainty assessment of optically active and inactive water quality parameters predictions using satellite data, deep and ensemble learnings

Spatiotemporal assessment of groundwater quality and quantity using geostatistical and ensemble artificial intelligence tools

Hybrid WT-CNN-GRU-based model for the estimation of reservoir water quality variables considering spatio-temporal features

Integration of band regression empirical water quality (BREWQ) model with deep learning algorithm in spatiotemporal modeling and prediction of surface water quality parameters

Robust clustering-based hybrid technique enabling reliable reservoir water quality prediction with uncertainty quantification and spatial analysis

Evaluation of River Water Quality Index Using Remote Sensing and Artificial Intelligence Models

Fuzzy Similarity Analysis of Effective Training Samples to Improve Machine Learning Estimations of Water Quality Parameters Using Sentinel-2 Remote Sensing Data

Using Ensemble Learning for Remote Sensing Inversion of Water Quality Parameters in Poyang Lake

Estimation of water quality variables based on machine learning model and cluster analysis-based empirical model using multi-source remote sensing data in inland reservoirs, South China

Enhancing the accuracy of metaheuristic neural networks in predicting underground water levels using meteorological data and remote sensing: A case study of Ardabil Plain, Iran

A Novel Approach to Uncertainty Quantification in Groundwater Table Modeling by Automated Predictive Deep Learning

Forecasting water quality variable using deep learning and weighted averaging ensemble models

Water Quality Prediction Based on an Improved ARIMA- RBF Model Facilitated by Remote Sensing Applications

Suitability Assessment of Cage Fish Farming Location in Reservoirs through Neural Networks-Based Remote Sensing Analysis

Enhancing groundwater quality prediction through ensemble machine learning techniques

Modeling Spatial Flood using Novel Ensemble Artificial Intelligence Approaches in Northern Iran

Water Resources Quality Indicators Monitoring by Nonlinear Programming and Simulated Annealing Optimization with Ensemble Learning Approaches

Estimation of the Biogeochemical and Physical Properties of Lakes Based on Remote Sensing and Artificial Intelligence Applications

An Ensemble Machine Learning Model to Estimate Urban Water Quality Parameters Using Unmanned Aerial Vehicle Multispectral Imagery

Study of water resources parameters using artificial intelligence techniques and learning algorithms: a survey

Estimation and uncertainty analysis of groundwater quality parameters in a coastal aquifer under seawater intrusion: a comparative study of deep learning and classic machine learning methods