Abstract:Log messages provide a valuable source of runtime information for ensuring the safety and consistency of systems. Recently, many machine learning and deep learning methods have been proposed to automatically detect anomalous log messages, obviating the need for manual detection by experts. However, we find that in practice, the effectiveness of existing learning-based methods is severely affected by incomplete information and distribution shift. Specifically, each log message can actually be parsed into a fixed number of key information fields, while existing methods analyze log messages using only the log event information and ignore other useful information fields that can be critical to anomaly detection. Further, the distribution of real-world log messages changes continuously due to the dynamic nature of the runtime environment and thus, a detection model conventionally trained based on the unrealistic i.i.d. assumption may not provide the expected and consistent performance. In this paper, we present a robust and transferable anomaly detection framework RT-Log to address the above problems. To perform a comprehensive analysis of log messages, we introduce an adaptive relation modeling technique, which captures feature interactions among log information fields selectively and dynamically for effective and interpretable log representations. To establish its robustness and transferability, we propose a general environment generalization technique for learning the environment invariant representations that can generalize across different runtime environments. We evaluate the anomaly detection performance of RT-Log on large real-world datasets. Extensive experimental results demonstrate that RT-Log consistently outperforms state-of-the-art methods by a significant margin under different settings.

Speed and Performance of Parserless and Unsupervised Anomaly Detection Methods on Software Logs

Natural Language Processing-based Model for Log Anomaly Detection

LogELECTRA: Self-supervised Anomaly Detection for Unstructured Logs

Log-based Anomaly Detection Without Log Parsing

LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs

RAPID: Training-free Retrieval-based Log Anomaly Detection with PLM considering Token-level information

Log-based Anomaly Detection of Enterprise Software: An Empirical Study

OneLog: Towards End-to-End Training in Software Log Anomaly Detection

A Taxonomy of Anomalies in Log Data

End-to-End AutoML for Unsupervised Log Anomaly Detection

An Anomaly Detection Approach of Part-of-Speech Log Sequence Via Population Based Training

FastLogAD: Log Anomaly Detection with Mask-Guided Pseudo Anomaly Generation and Discrimination

Log-based Anomaly Detection based on EVT Theory with feedback

LogCAE: an Approach for Log-based Anomaly Detection with Active Learning and Contrastive Learning

On the Effectiveness of Log Representation for Log-based Anomaly Detection

Reducing Events to Augment Log-based Anomaly Detection Models: An Empirical Study

AutoLog: A Log Sequence Synthesis Framework for Anomaly Detection

Recurrent Neural Network Language Models for Open Vocabulary Event-Level Cyber Anomaly Detection

TPLogAD: Unsupervised Log Anomaly Detection Based on Event Templates and Key Parameters

Robust and Transferable Log-based Anomaly Detection.

EAD: effortless anomalies detection, a deep learning based approach for detecting outliers in English textual data