Abstract:Recently, artificial intelligence (AI) techniques have been used to describe the characteristics of information, as they help in the process of data mining (DM) to analyze data and reveal rules and patterns. In DM, anomaly detection is an important area that helps discover hidden behavior within the data that is most vulnerable to attack. It also helps detect network intrusion. Algorithms such as hybrid K-mean array and sequential minimal optimization (SMO) rating can be used to improve the accuracy of the anomaly detection rate. This paper presents an anomaly detection model based on the machine learning (ML) technique. ML improves the detection rate, reduces the false-positive alarm rate, and is capable of enhancing the accuracy of intrusion classification. This study used a dataset known as network security-knowledge and data discovery (NSL-KDD) lab to evaluate a proposed hybrid ML technology. K-mean cluster and SMO were used for classification. In the study, the performance of the proposed anomaly detection was tested, and results showed that the use of K-mean and SMO enhances the rate of positive detection besides reducing the rate of false alarms and achieving a high accuracy at the same time. Moreover, the proposed algorithm outperformed recent and close work related to using similar variables and the environment by 14.48% and decreased false alarm probability (FAP) by (12%) in addition to giving a higher accuracy by 97.4%. These outcomes are attributed to the common algorithm providing an appropriate number of detectors to be generated with an acceptable accurate detection and a trivial false alarm probability (FAP). The proposed hybrid algorithm could be considered for anomaly detection in future data mining systems, where processing in real-time is highly likely to be reduced dramatically. The justification is that the hybrid algorithm can provide appropriate detectors numbers that can be generated with an acceptable detection accuracy and trivial FAP. Given to the low FAP, it is highly expected to reduce the time of the preprocessing and processing compared with the other algorithms.

KMDT: A Hybrid Cluster Approach for Anomaly Detection Using Big Data

New hybrid ensemble method for anomaly detection in data science

A Modified Hybrid Method Based on PSO, GA, and K-Means for Network Anomaly Detection

An Empirical Study on Anomaly Detection Using Density-based and Representative-based Clustering Algorithms

Detection of Cluster Anomalies With ML Techniques

Machine Learning-Based Anomaly Detection Using K-mean Array and Sequential Minimal Optimization

DeepHYDRA: Resource-Efficient Time-Series Anomaly Detection in Dynamically-Configured Systems

A Clustering Algorithm for Multi-Modal Heterogeneous Big Data With Abnormal Data

Anomaly Detection in Big Data

Extending Isolation Forest for Anomaly Detection in Big Data via K-Means

Developing Big Data Anomaly Dynamic and Static Detection Algorithms: AnomalyDSD Spark Package

A hybrid unsupervised clustering-based anomaly detection method

Multilevel Anomaly Detection for Mixed Data

Hybrid Intrusion Detection Method Based on CM-K-Means

A Hybrid Approach: Utilising Kmeans Clustering and Naive Bayes for IoT Anomaly Detection

A Data Mining Method Using Deep Learning for Anomaly Detection in Cloud Computing Environment

Hybrid Group Anomaly Detection for Sequence Data: Application to Trajectory Data Analytics

K-MDTSC: K-Multi-Dimensional Time-Series Clustering Algorithm

Hybrid Weighted K-Means Clustering and Artificial Neural Network for an Anomaly-Based Network Intrusion Detection System

Anomaly Detection for Industrial Big Data

Parallel power load abnormalities detection using fast density peak clustering with a hybrid canopy-K-means algorithm