Abstract:Log-based anomaly detection has been widely studied in the literature as a way to increase the dependability of software-intensive systems. In reality, logs can be unstable due to changes made to the software during its evolution. This, in turn, degrades the performance of downstream log analysis activities, such as anomaly detection. The critical challenge in detecting anomalies on these unstable logs is the lack of information about the new logs, due to insufficient log data from new software versions. The application of Large Language Models (LLMs) to many software engineering tasks has revolutionized various domains. In this paper, we report on an experimental comparison of a fine-tuned LLM and alternative models for anomaly detection on unstable logs. The main motivation is that the pre-training of LLMs on vast datasets may enable a robust understanding of diverse patterns and contextual information, which can be leveraged to mitigate the data insufficiency issue in the context of software evolution. Our experimental results on the two-version dataset of LOGEVOL-Hadoop show that the fine-tuned LLM (GPT-3) fares slightly better than supervised baselines when evaluated on unstable logs. The difference between GPT-3 and other supervised approaches tends to become more significant as the degree of changes in log sequences increases. However, it is unclear whether the difference is practically significant in all cases. Lastly, our comparison of prompt engineering (with GPT-4) and fine-tuning reveals that the latter provides significantly superior performance on both stable and unstable logs, offering valuable insights into the effective utilization of LLMs in this domain.

LLMeLog: an Approach for Anomaly Detection Based on LLM-enriched Log Events

LogLLM: Log-based Anomaly Detection Using Large Language Models

Natural Language Processing-based Model for Log Anomaly Detection

Log Anomaly Detection method based on BERT model optimization

MLog: Mogrifier LSTM-based Log Anomaly Detection Approach Using Semantic Representation

Leveraging RAG-Enhanced Large Language Model for Semi-Supervised Log Anomaly Detection

Leveraging Large Language Models and BERT for Log Parsing and Anomaly Detection

BERT-Log: Anomaly Detection for System Logs Based on Pre-trained Language Model

Anomaly Detection Model for Log Based on LSTM Network and Variational Autoencoder

LogCAE: an Approach for Log-based Anomaly Detection with Active Learning and Contrastive Learning

An Anomaly Detection Approach of Part-of-Speech Log Sequence Via Population Based Training

AcLog: an Approach to Detecting Anomalies from System Logs with Active Learning

SemLog: A Semantics-based Approach for Anomaly Detection in Big Data System Logs

Log-based Anomaly Detection Without Log Parsing

MetaLog: Generalizable Cross-System Anomaly Detection from Logs with Meta-Learning.

LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs

Anomaly Detection on Unstable Logs with GPT Models

OMLog: Online Log Anomaly Detection for Evolving System with Meta-learning

AFALog: A General Augmentation Framework for Log-based Anomaly Detection with Active Learning

A LSTM-Based Anomaly Detection Model for Log Analysis

LogBERT: Log Anomaly Detection via BERT