Abstract:Attacks on network systems are becoming more and more common, the current state of increasingly sophisticated attack methods, the emergence of intrusion prevention technology is the inevitable result of the development of computer technology and network technology, and research on intrusion prevention has become a new focus of network security technology research in recent years. In order to ensure the security of computer network confidential information, the authors propose a semisupervised clustering intrusion detection algorithm. An overview of machine learning, followed by an explanation of the theory of cluster analysis, simulation experiments were carried out using the K -means algorithm and the semisupervised clustering algorithm proposed by the author, for 10,000 records, the K -means clustering algorithm and the semisupervised clustering algorithm described in this paper are used, respectively, and intrusion detection data tests were performed. At the same time, different K values were selected, three datasets were selected from "kddcup.newtestdata_10_percent_corrected," the test data were tested separately, and their average value was taken as the test result. From the simulation results, the detection rate of the semisupervised clustering algorithm is higher than that of the K -means clustering algorithm, and the false alarm rate and K -means algorithms have also been improved. Therefore, the author's semisupervised algorithm enhances the stability of the system, and the performance of the K -means algorithm is improved to a certain extent. When the value of K gradually increases, the false alarm rate also increases; however, when K is 20, the detection rate is maximized, from this, it can be known that when K is 20, its detection rate reaches 91.76%, and the false alarm rate is 8.54%. The detection rate of the author's algorithm is significantly higher than the other two algorithms, the false positive rate is slightly higher than K -means, and the false positive rate is lower than that of the other algorithm, proving the superior performance of our algorithm.

Algorithms and methods of data clustering in the analysis of information security event logs

System log clustering approaches for cyber security applications: A survey

A Method of Data Mining Based on SOM Clustering and Its Application

Clustering event logs using iterative partitioning

INVESTIGATION OF CLUSTERING AND CLASSIFICATION METHODS FOR INTELLECTUAL ANALYSIS OF LOG FILES

Research on Clustering Algorithm of Load Decomposition Considering Harmonic Characteristics in Power Safety Monitoring

Log Analysis Techniques using Clustering in Network Forensics

ClusterLog: Clustering Logs for Effective Log-based Anomaly Detection

Using Visualization to Improve Clustering Analysis on Heterogeneous Information Network.

A Log Analysis Audit Model Based on Optimized Clustering Algorithm

Dynamic log file analysis: An unsupervised cluster evolution approach for anomaly detection

An Analysis to Find the Efficient Clustering Algorithm for Identification of User Access Pattern

Computer Network Confidential Information Security Based on Big Data Clustering Algorithm

Nuclear Clustering Algorithm on State Grid's IT Operation Log

Detecting and Identifying Insider Threats Based on Advanced Clustering Methods

Intelligent Algorithms for Event Processing and Decision Making on Information Protection Strategies against Cyberattacks

Distributed Information Theoretic Clustering

A Statistical Information-Based Clustering Approach in Distance Space

An Integrated Method for Anomaly Detection From Massive System Logs.

Error Log Clustering of Internet Software

Development of Anomaly Detection System Based on Distributed Log Tracing