Continuous Kernel-Based Outlier Detection Over Distributed Data Streams

Liang Su,Weihong Han,Peng Zou,Yan Jia
DOI: https://doi.org/10.1007/978-3-540-74767-3_32
2007-01-01
Abstract:Stream data are often transmitted over a distributed network, but in many cases, are too voluminous to be collected in a central location. Instead, we must perform distributed computations, guaranteeing high quality results in real-time even as new data arrive. In this paper, firstly, we formalize the problem of continuous outlier detection over distributed evolving data streams. Then, two novel outlier measures and algorithms are proposed which can identify outliers in a single pass. Furthermore, our experiments with synthetic and real data show that the proposed methods are both efficient and effective compared with existing outlier detection algorithms.
What problem does this paper attempt to address?