Abstract:There is an increasing need for both governments and businesses to discover latent anomalous activities in unstructured publicly-available data, produced by professional agencies and the general public. Over the past two decades, consumers have begun to use smart devices to both take in and generate a large volume of open-source text-based data, providing the opportunity for latent anomaly analysis. However, real-time data acquisition, and the processing and interpretation of various types of unstructured data, remains a great challenge. Recent efforts have focused on artificial intelligence / machine learning (AI/ML) solutions to accelerate the labor-intensive linear collection, exploitation, and dissemination analysis cycle and enhance it with a data-driven rapid integration and correlation process of open-source data. This paper describes an Activity Based Intelligence framework for anomaly detection of open-source big data using AI/ML to perform semantic analysis. The proposed Anomaly Detection using Semantic Analysis Knowledge (ADUSAK) framework includes four layers: input layer, knowledge layer, reasoning layer, and graphical user interface (GUI)/output layer. The corresponding main technologies include: Information Extraction, Knowledge Graph (KG) construction, Semantic Reasoning, and Pattern Discovery. Finally, ADUSAK was verified by performing Emerging Events Detection, Fake News Detection, and Suspicious Network Analysis. The generalized ADUSAK framework can be easily extended to a wide range of applications by adjusting the data collection, modeling construction, and event alerting.

xSemAD: Explainable Semantic Anomaly Detection in Event Logs Using Sequence-to-Sequence Models

The Analysis of Online Event Streams: Predicting the Next Activity for Anomaly Detection

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

CausalConvLSTM: Semi-Supervised Log Anomaly Detection Through Sequence Modeling

An Anomaly Detection Approach of Part-of-Speech Log Sequence Via Population Based Training

Extracting Semantic Process Information from the Natural Language in Event Logs

SemLog: A Semantics-based Approach for Anomaly Detection in Big Data System Logs

Event log anomaly detection method based on auto-encoder and control flow

LogAnomaly: Unsupervised Detection of Sequential and Quantitative Anomalies in Unstructured Logs

A Semi-Supervised Approach for Abnormal Event Prediction on Large Operational Network Time-Series Data

Semi-supervised Log Pattern Detection and Exploration Using Event Concurrence and Contextual Information

Anomaly detection of unstructured big data via semantic analysis and dynamic knowledge graph construction

Prototypes as Explanation for Time Series Anomaly Detection

Semantic Anomaly Detection with Large Language Models

Anomaly Rule Detection in Sequence Data

Event-driven Weakly Supervised Video Anomaly Detection

A Semi-Supervised Learning Approach for Abnormal Event Prediction on Large Network Operation Time-Series Data

DABL: Detecting Semantic Anomalies in Business Processes Using Large Language Models

A Framework for Pattern Mining and Anomaly Detection in Multi-dimensional Time Series and Event Logs

AcME-AD: Accelerated Model Explanations for Anomaly Detection

From Explanation to Action: An End-to-End Human-in-the-loop Framework for Anomaly Reasoning and Management