Inversion Method for Chlorophyll-a Concentration in High-Salinity Water Based on Hyperspectral Remote Sensing Data

Nan Wang,Zhiguo Wang,Pingping Huang,Yongguang Zhai,Xiangli Yang,Jianyu Su
DOI: https://doi.org/10.3390/s24134181
IF: 3.9
2024-06-28
Sensors
Abstract:As one of the important lakes in the "One Lake and Two Seas" of the Inner Mongolia Autonomous Region, the monitoring of water quality in Lake Daihai has attracted increasing attention, and the concentration of chlorophyll-a directly affects the water quality, making the monitoring of chlorophyll-a concentration in Lake Daihai particularly crucial. Traditional methods of monitoring chlorophyll-a concentration are not only inefficient but also require significant human and material resources. Remote sensing technology has the advantages of wide coverage and short update cycles. For lakes such as Daihai with a high salinity content, salinity is considered a key factor when inverting the concentration of chlorophyll-a. In this study, machine learning models, including model stacking from ensemble learning, a ridge regression model, and a random forest model, were constructed. After comparing the training accuracy of the three models on Zhuhai-1 satellite data, the random forest model, which had the highest accuracy, was selected as the final training model. By comparing the accuracy changes before and after adding salinity factors to the random forest model, a high-precision model for inverting chlorophyll-a concentration in hypersaline lakes was obtained. The research results show that, without considering the salinity factor, the root mean square error (RMSE) of the model was 0.056, and the coefficient of determination (R2) was 0.64, indicating moderate model performance. After adding the salinity factor, the model accuracy significantly improved: the RMSE decreased to 0.047, and the R2 increased to 0.92. This study provides a solid basis for the application of remote sensing technology in hypersaline aquatic environments, confirming the importance of considering salinity when estimating chlorophyll-a concentration in hypersaline waters. This research helps us gain a deeper understanding of the water quality and ecosystem evolution in Daihai Lake.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in a high - salinity water environment, how to accurately monitor the concentration of chlorophyll - a using hyperspectral remote sensing data. Specifically, the research focuses on the water quality monitoring problem of Daihai Lake in Inner Mongolia Autonomous Region. ### Problem Background Traditional methods for monitoring the concentration of chlorophyll - a are not only inefficient but also require a large amount of human and material resources. Remote sensing technology, on the other hand, has the advantages of large - scale coverage and short update cycles, and is suitable for water quality monitoring of large - area water bodies such as lakes. For lakes with high salinity like Daihai Lake, salinity is considered one of the key factors affecting the inversion of chlorophyll - a concentration. ### Research Objectives The objective of this study is to significantly improve the accuracy of the chlorophyll - a concentration prediction model by comprehensively considering various environmental parameters, especially salinity. Specifically, the research aims to develop and analyze multiple machine - learning algorithms to determine the most suitable model for predicting the concentration of chlorophyll - a. By comparing the changes in model accuracy before and after adding the salinity factor, accurate prediction of the concentration of chlorophyll - a is achieved. ### Main Methods 1. **Data Pre - processing**: - Use ENVI5.3 software for image reading, radiometric correction and atmospheric correction. - Use RPC information for orthorectification to ensure that the image matches the actual geographical location. 2. **Feature Selection**: - Perform correlation analysis and principal component analysis (PCA) on the pre - processed Zhuhai - 1 satellite data to screen out key feature bands. 3. **Model Construction and Evaluation**: - Construct multiple machine - learning models, including model stacking in ensemble learning, ridge regression model and random forest model. - Compare the accuracy performance of different models on the same data set, and finally select the random forest model with the highest accuracy. - After adding the salinity factor, re - perform PCA, obtain new feature bands, and further evaluate the inversion accuracy of the model. 4. **Result Verification**: - Establish the model through cross - validation, and compare the changes in model accuracy before and after adding the salinity factor. - The results show that when the salinity factor is not considered, the root - mean - square error (RMSE) of the model is 0.056, and the coefficient of determination (R²) is 0.64; after adding the salinity factor, the RMSE is reduced to 0.047, and R² is increased to 0.92. ### Research Significance This research provides a solid scientific basis for water quality monitoring and management in high - salinity water environments, and confirms the importance of considering salinity when estimating the concentration of chlorophyll - a in high - salinity water bodies. This is of great significance for in - depth understanding of the water quality status and ecosystem evolution of Daihai Lake.