Data-Driven Anomaly Detection Approach for Time-Series Streaming Data

Minghu Zhang,Jianwen Guo,Xin Li,Rui Jin
DOI: https://doi.org/10.3390/s20195646
IF: 3.9
2020-10-02
Sensors
Abstract:Recently, wireless sensor networks (WSNs) have been extensively deployed to monitor environments. Sensor nodes are susceptible to fault generation due to hardware and software failures in harsh environments. Anomaly detection for the time-series streaming data of sensor nodes is a challenging but critical fault diagnosis task, particularly in large-scale WSNs. The data-driven approach is becoming essential for the goal of improving the reliability and stability of WSNs. We propose a data-driven anomaly detection approach in this paper, named median filter (MF)-stacked long short-term memory-exponentially weighted moving average (LSTM-EWMA), for time-series status data, including the operating voltage and panel temperature recorded by a sensor node deployed in the field. These status data can be used to diagnose device anomalies. First, a median filter (MF) is introduced as a preprocessor to preprocess obvious anomalies in input data. Then, stacked long short-term memory (LSTM) is employed for prediction. Finally, the exponentially weighted moving average (EWMA) control chart is employed as a detector for recognizing anomalies. We evaluate the proposed approach for the panel temperature and operating voltage of time-series streaming data recorded by wireless node devices deployed in harsh field conditions for environmental monitoring. Extensive experiments were conducted on real time-series status data. The results demonstrate that compared to other approaches, the MF-stacked LSTM-EWMA approach can significantly improve the detection rate (DR) and false rate (FR). The average DR and FR values with the proposed approach are 95.46% and 4.42%, respectively. MF-stacked LSTM-EWMA anomaly detection also achieves a better F2 score than that achieved by other methods. The proposed approach provides valuable insights for anomaly detection in WSNs by detecting anomalies in the time-series status data recorded by wireless sensor nodes.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the issue of anomaly detection in time-series stream data within wireless sensor networks (WSN). Specifically, it proposes a data-driven approach to detect anomalies in the state data of sensor nodes, including metrics such as operating voltage and panel temperature. #### Main Objectives - **Improve the reliability and stability of WSN**: By timely detecting and diagnosing anomalies in sensor nodes, the quality of environmental monitoring services can be enhanced. - **Propose a new anomaly detection method**: Combining Median Filter (MF), Stacked Long Short-Term Memory (LSTM), and Exponentially Weighted Moving Average Control Chart (EWMA) to form an efficient data-driven anomaly detection method. #### Key Issues - **Device fault detection in large-scale WSN**: Many WSN devices are prone to failure in harsh environments, necessitating an effective automatic detection system to identify these faults. - **Anomaly analysis of state data**: The operating voltage and panel temperature of sensor nodes can be used to diagnose whether the equipment is functioning normally, thereby ensuring the continuity and accuracy of environmental monitoring. ### Method Overview - **Median Filter (MF) Preprocessing**: Used to preprocess obvious anomalies in the input data. - **Stacked LSTM Prediction**: Predicts the state data of sensor nodes and calculates the residuals between the predicted values and the actual values. - **EWMA Control Chart Detection**: Detects and reports anomalies in the residuals. ### Experimental Evaluation - **Study Area**: Heihe River Basin, located in the northwest region of China, covering an area of approximately 143,000 square kilometers. - **Dataset**: Selected panel temperature and operating voltage data recorded by the Arou Superstation, with a time range from November 1, 2019, to November 10, 2019, and a collection interval of 10 minutes. - **Evaluation Metrics**: Detection Rate (DR) and False Rate (FR). Experimental results show that the proposed MF-stacked LSTM-EWMA method significantly improves detection performance, with an average DR of 95.46% and an FR of 4.42%. ### Conclusion This paper proposes a new data-driven anomaly detection method that can effectively identify anomalies in the state data of sensor nodes in WSN, thereby improving the reliability and stability of the system.