Improving Graph Convolutional Network with Learnable Edge Weights and Edge-Node Co-Embedding for Graph Anomaly Detection

Xiao Tan,Jianfeng Yang,Zhengang Zhao,Jinsheng Xiao,Chengwang Li
DOI: https://doi.org/10.3390/s24082591
IF: 3.9
2024-04-19
Sensors
Abstract:The era of Industry 4.0 is gradually transforming our society into a data-driven one, which can help us uncover valuable information from accumulated data, thereby improving the level of social governance. The detection of anomalies, is crucial for maintaining societal trust and fairness, yet it poses significant challenges due to the ubiquity of anomalies and the difficulty in identifying them accurately. This paper aims to enhance the performance of the current Graph Convolutional Network (GCN)-based Graph Anomaly Detection (GAD) algorithm on datasets with extremely low proportions of anomalous labels. This goal is achieved through modifying the GCN network structure and conducting feature extraction, thus fully utilizing three types of information in the graph: node label information, node feature information, and edge information. Firstly, we theoretically demonstrate the relationship between label propagation and feature convolution, indicating that the Label Propagation Algorithm (LPA) can serve as a regularization penalty term for GCN, aiding in training and enabling learnable edge weights, providing a basis for incorporating node label information into GCN networks. Secondly, we introduce a method to aggregate node and edge features, thereby incorporating edge information into GCN networks. Finally, we design different GCN trainable weights for node features and co-embedding features. This design allows different features to be projected into different spaces, greatly enhancing model expressiveness. Experimental results on the DGraph dataset demonstrate superior AUC performance compared to baseline models, highlighting the feasibility and efficacy of the proposed approach in addressing GAD tasks in the scene with extremely low proportions of anomalous data.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the performance of graph anomaly detection (GAD) algorithms based on graph convolutional networks (GCN) in datasets with an extremely low proportion of abnormal labels. Specifically, by modifying the GCN network structure and performing feature extraction, the paper makes full use of three kinds of information in the graph: node label information, node feature information, and edge information, to enhance the performance of the model in dealing with datasets with an extremely low proportion of abnormal data. ### Main contributions of the paper 1. **Theoretical analysis and introduction of learnable edge weights**: - The author theoretically analyzes the relationship between the feature convolution process of GCN and the label propagation process of LPA (Label Propagation Algorithm), and proves that using LPA as an auxiliary regularization term in the GCN loss function can train edge weights, thereby improving the performance of GCN. - Based on this theoretical analysis, the paper introduces learnable edge weights and uses LPA to train them, modifies the convolution process of GCN, and makes full use of node label information. 2. **Utilization of heterogeneous edge information**: - Considering the heterogeneity of edges, the paper analyzes the impact of edge information on the GAD task. First, it embeds the direction and attribute information of edges to generate edge features, and then aggregates these edge features with node features to generate co - embedded features. This method performs feature extraction and makes full use of the information on the edges. 3. **Independent training of node features and co - embedded features**: - Based on the co - embedded features, different GCN trainable weights are designed for node features and co - embedded features respectively, and they are projected into different feature spaces, which greatly enhances the generalization ability of the model. ### Experimental results The paper conducted experiments on the publicly available DGraph dataset, selected MLP, Node2Vec, LPA, GCN, GAT, GATv2, GraphSAGE, and UniMP as baseline models, and used AUC (Area Under the ROC Curve) as an evaluation metric. The experimental results show that the proposed method outperforms the baseline models in AUC performance, especially in datasets with an extremely low proportion of abnormal nodes. ### Conclusion The paper proposes a unified model that can effectively address three challenges in graphs: the utilization of node labels and features, the utilization of edge information, and the problem of an extremely low proportion of abnormal data. By introducing learnable edge weights, utilizing edge information, and designing independent GCN weights, this model performs well in graph anomaly detection tasks and provides valuable insights for applications dealing with sparse data.