A Novel Online Hydrological Data Quality Control Approach Based on Adaptive Differential Evolution

Qun Zhao,Shicheng Cui,Yuelong Zhu,Rui Li,Xudong Zhou
DOI: https://doi.org/10.3390/math12121821
IF: 2.4
2024-06-13
Mathematics
Abstract:The quality of hydrological data has a significant impact on hydrological models, where stable and anomaly-free hydrological time series typically yield more valuable patterns. In this paper, we conduct data analysis and propose an online hydrological data quality control method based on an adaptive differential evolution algorithm according to the characteristics of hydrological data. Taking into account the characteristics of continuity, periodicity, and seasonality, we develop a Periodic Temporal Long Short-Term Memory (PT-LSTM) predictive control model. Building upon the real-time nature of the data, we apply the Adaptive Differential Evolution algorithm to optimize PT-LSTM, creating an Online Composite Predictive Control Model (OCPT-LSTM) that provides confidence intervals and recommended values for control and replacement. The experimental results demonstrate that the proposed data quality control method effectively manages data quality; detects data anomalies; provides suggested values; reduces reliance on manual intervention; provides a solid data foundation for hydrological data analysis work; and helps hydrological personnel in water resource scheduling, flood control, and other related tasks. Meanwhile, the proposed method can also be applied to the analysis of time series data in other industries.
mathematics
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address several key issues in hydrological data quality control: 1. **Dependence on Manual Intervention**: - Currently, most hydrological data quality control methods are still in the theoretical research and modeling stage. Controlling data quality often requires manual intervention, lacking intelligent data quality control algorithms and models. 2. **Low Credibility of Short-term Missing Data Imputation**: - For short-term missing values caused by equipment failures, commonly used interpolation methods (such as the average method, weighted method, or simple spatial interpolation method) perform poorly, and the credibility of the imputed data is difficult to assess. 3. **Lack of Reliable Replacement Values for Anomalous Data**: - Many current methods detect anomalous data through basic checks and standard settings but fail to provide reliable replacement values for the anomalous data. To address these issues, this study proposes an intelligent hydrological data quality control method based on the continuity, periodicity, seasonality, and real-time nature of hydrological data. The aim is to reduce dependence on manual intervention, optimize hydrological data quality, and thereby improve the stability and accuracy of data mining models.