Anomaly Detection Based on Isolation Mechanisms: A Survey

Yang Cao,Haolong Xiang,Hang Zhang,Ye Zhu,Kai Ming Ting
2024-03-16
Abstract:Anomaly detection is a longstanding and active research area that has many applications in domains such as finance, security, and manufacturing. However, the efficiency and performance of anomaly detection algorithms are challenged by the large-scale, high-dimensional, and heterogeneous data that are prevalent in the era of big data. Isolation-based unsupervised anomaly detection is a novel and effective approach for identifying anomalies in data. It relies on the idea that anomalies are few and different from normal instances, and thus can be easily isolated by random partitioning. Isolation-based methods have several advantages over existing methods, such as low computational complexity, low memory usage, high scalability, robustness to noise and irrelevant features, and no need for prior knowledge or heavy parameter tuning. In this survey, we review the state-of-the-art isolation-based anomaly detection methods, including their data partitioning strategies, anomaly score functions, and algorithmic details. We also discuss some extensions and applications of isolation-based methods in different scenarios, such as detecting anomalies in streaming data, time series, trajectory, and image datasets. Finally, we identify some open challenges and future directions for isolation-based anomaly detection research.
Machine Learning
What problem does this paper attempt to address?
The paper primarily addresses the issue of anomaly detection in the context of big data, particularly the challenges faced by traditional anomaly detection algorithms in terms of efficiency and performance when dealing with large-scale, high-dimensional, and heterogeneous data. The paper focuses on unsupervised anomaly detection methods based on isolation mechanisms and explores their advantages in different application scenarios. Specifically, the core issues addressed by the paper are: 1. **Proposing anomaly detection methods based on isolation mechanisms**: These methods utilize the isolation characteristics between data points to identify anomalies. Anomalies are considered rare and different from normal data points, thus they can be more easily isolated through random partitioning. 2. **Reviewing existing technologies**: The paper reviews the latest anomaly detection methods based on isolation mechanisms, including their data partitioning strategies, anomaly scoring functions, and algorithmic details. 3. **Discussing extended applications**: The paper also discusses the application of isolation-based methods in various scenarios such as streaming data, time series, trajectory, and image datasets. 4. **Analyzing challenges and future directions**: Finally, the paper points out some open challenges and future research directions in the study of isolation-based anomaly detection. In short, this paper aims to provide researchers and practitioners with a comprehensive review of isolation-based anomaly detection methods, offering a framework to understand these novel and efficient anomaly detection techniques, and highlighting future development trends in this field.