ActiveAD: Enhancing Anomaly Detection in Tabular Data through Active Learning Strategies

Haoyan Luo, Xiaofan Gui, Wei Cao, Jiang Bian
Abstract:Detecting anomalies in tabular data is critical in many fields, including cybersecurity, finance, and healthcare. However, labeling data for anomaly detection is often labor-intensive and costly. Active learning (AL) emerges as a promising approach to mitigate these challenges, aiming to reduce the labeling cost while maintaining high detection performance. In our project, we propose a pipeline, ActiveAD, for active anomaly detection that combines various anomaly detection models and active learning querying strategies to improve the efficiency and effectiveness of identifying anomalies with limited labeled data. Benefiting from the recent study on anomaly detection benchmark, we also offer a comprehensive comparison of different active learning method performance on diverse datasets. Extensive experiments reveal the strengths and weaknesses of each method and the impact of outliers, providing valuable insights into their suitability under different conditions.
What problem does this paper attempt to address?