A Knowledge Graph Completion Algorithm Based on the Fusion of Neighborhood Features and vBiLSTM Encoding for Network Security

Wenbo Zhang,Mengxuan Wang,Guangjie Han,Yongxin Feng,Xiaobo Tan
DOI: https://doi.org/10.3390/electronics13091661
IF: 2.9
2024-04-26
Electronics
Abstract:Knowledge graphs in the field of network security can integrate diverse, heterogeneous, and fragmented network security data, further explore the relationships between data, and provide support for deep analysis. Currently, there is sparse security information in the field of network security knowledge graphs. The limited information provided by traditional text encoding models leads to insufficient reasoning ability, greatly restricting the development of this field. Starting from text encoding, this paper first addresses the issue of the inadequate capabilities of traditional models using a deep learning model for assistance. It designs a vBiLSTM model based on a word2vec and BiLSTM combination to process network security texts. By utilizing word vector models to retain semantic information in entities and extract key features to input processed data into BiLSTM networks for extracting higher-level features that better capture and express their deeper meanings, this design significantly enhances understanding and expression capabilities toward complex semantics in long sentences before inputting final feature vectors into the KGC-N model. The KGC-N model uses feature vectors combined with graph structure information to fuse forward and reverse domain features and then utilizes a Transformer decoder to decode predictions and complete missing information within the network security knowledge map. Compared with other models using evaluation metrics such as MR, MRR demonstrates that employing our proposed method effectively improves performance on completion tasks and increases comprehension abilities toward complex relations, thereby enhancing accuracy and efficiency when completing knowledge graphs.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
The paper attempts to address the issue of information sparsity in the cybersecurity knowledge graph, specifically including the following aspects: 1. **Limitations of traditional text encoding models**: Existing text encoding models are insufficient in handling the complex semantics in the field of cybersecurity, which limits the reasoning capabilities of the knowledge graph. 2. **Sparsity of the cybersecurity knowledge graph**: The knowledge graph in the field of cybersecurity is sparse due to its high specialization and fragmented data, which restricts its use in advanced applications such as question-answering systems. 3. **Insufficiency of existing knowledge graph completion techniques**: Although existing knowledge graph completion techniques (such as semantic matching models based on tensor/matrix decomposition, neural network models, translation models, etc.) can enrich the knowledge graph, their applicability in the field of cybersecurity is limited. To address these issues, the paper proposes a cybersecurity knowledge graph completion algorithm (KGC-N) based on neighborhood feature fusion and vBiLSTM encoding, aiming to improve the completeness and accuracy of the knowledge graph through the following methods: - **vBiLSTM model**: Combining word2vec and BiLSTM models to extract word-level features from entity descriptions and further extract high-level features from long sentences, enhancing the understanding and expression of complex semantics. - **KGC-N model**: Utilizing graph structure information to fuse features from forward and backward domains, and predicting missing information through a Transformer decoder, effectively solving the information sparsity problem in the cybersecurity knowledge graph. Through these methods, the paper aims to improve the performance of the knowledge graph completion task, enhance the understanding of complex relationships, and thereby improve the accuracy and efficiency of the knowledge graph.