What problem does this paper attempt to address?

This paper aims to solve several key problems in anomaly detection, especially how to effectively identify anomalies when the supervision information is limited or completely unsupervised. Specifically, the paper attempts to solve the following problems: 1. **Lack of supervision information**: In many anomaly detection application scenarios, not all anomaly types can be known in advance, so it is necessary to be able to identify new, unseen anomaly types during the operation process. Traditional supervised methods perform poorly in this case. 2. **Requirement for online processing**: Anomaly detection usually involves a large amount of data, among which only a few samples are anomalies. The classical Dictionary Learning (DL) method is difficult to deal with such large - scale data sets, so it is necessary to develop methods that can process data online. 3. **Low false positive rate**: In practical applications, such as anti - money laundering and fraud detection, the false positive rate must be as low as possible to ensure the reliability of the system. To solve these problems, the paper proposes two anomaly detection methods based on the dictionary - learning framework: - **Semi - supervised online algorithm (TODDLeR)**: This method allows online learning and classification of newly arrived signals under the premise of having a small amount of labeled data. It combines the classifier and the label - consistency dictionary by expanding the classical dictionary - learning objective function, thereby improving the ability to identify new anomaly types. Its optimization objective can be expressed as: \[ \min_{D,W,A} \|y - Dx\|^2_2+\alpha \|h - Wx\|^2_2+\beta \|q - Ax\|^2_2+\lambda_1 \|W - W_0\|^2_F+\lambda_2 \|A - A_0\|^2_F \] where \(y\) is the newly arrived signal, \(D\) is the dictionary, \(W\) and \(A\) are the linear classifier and the label - consistency dictionary respectively, \(h\) and \(q\) are the estimated label and atom - assignment matrices, and \(\lambda_1\) and \(\lambda_2\) are regularization parameters. - **Unsupervised method**: This method is applicable to the situation where there are no labels at all. It infers the nature of samples by using performance indicators in the dictionary - learning process. Specifically, by gradually filtering out those signals that are less likely to be anomalies, the potential anomaly set is finally determined. For example, normal samples and anomaly samples can be distinguished by calculating the representation error or the atom popularity. These methods have been tested in practical application scenarios such as financial fraud detection, showing good performance and a low false positive rate.

Unsupervised Dictionary Learning for Anomaly Detection

Detecting Anomalies In Encrypted Traffic Via Deep Dictionary Learning

Learning Discrimination from Contaminated Data: Multi-Instance Learning for Unsupervised Anomaly Detection

Fusing Dictionary Learning and Support Vector Machines for Unsupervised Anomaly Detection

Anomaly Detection with Selective Dictionary Learning

Toward Supervised Anomaly Detection

Learning Discriminative Features for Semi-Supervised Anomaly Detection.

Deep Learning for Anomaly Detection: Challenges, Methods, and Opportunities

Self-Supervised Learning for Online Anomaly Detection in High-Dimensional Data Streams

Dynamic Threshold-based Two-layer Online Unsupervised Anomaly Detector

Deep Anomaly Detection and Search via Reinforcement Learning

Anomalous Example Detection in Deep Learning: A Survey

Toward Deep Supervised Anomaly Detection: Reinforcement Learning from Partially Labeled Anomaly Data

Learning Competitive and Discriminative Reconstructions for Anomaly Detection

Unsupervised Anomaly Detection with Rejection

Learning to Detect Interesting Anomalies

An Extreme Learning Machine for Unsupervised Online Anomaly Detection in Multivariate Time Series

How Low Can You Go? Surfacing Prototypical In-Distribution Samples for Unsupervised Anomaly Detection

Self-Supervised Learning for Anomaly Detection With Dynamic Local Augmentation

Online Model-based Anomaly Detection in Multivariate Time Series: Taxonomy, Survey, Research Challenges and Future Directions

Self-Supervised Time-Series Anomaly Detection Using Learnable Data Augmentation