Research on Anomaly Detection Methodology Combining Large Language Models

Dengjiang Cai,Xingxin Leng,Lixuan Qiu,Tingting Zhang
DOI: https://doi.org/10.1109/CISCE62493.2024.10653263
2024-05-10
Abstract:With the development of big data and artificial intelligence technologies, the importance of anomaly detection in data analysis is becoming increasingly prominent. In this study, we propose a novel detection method called SemantEdge Detection (SED), which combines large language models with unsupervised anomaly detection algorithms, aiming to improve detection performance at semantic boundaries. Four different types of unsupervised algorithms, namely iForest, HBOS, KNN, and LODA, are applied on the CCF dataset, combined with the semantic understanding ability of large language models, to experimentally verify the effectiveness of anomaly detection at boundary situations. The results demonstrate that this method effectively enhances the detection performance of unsupervised algorithms near boundary values. This study not only provides new research ideas for unsupervised anomaly detection techniques but also offers empirical evidence for the application of large language models in the field of anomaly detection.
Computer Science
What problem does this paper attempt to address?