Robust Graph Representation Learning for Local Corruption Recovery

Bingxin Zhou,Yuanhong Jiang,Yu Guang Wang,Jingwei Liang,Junbin Gao,Shirui Pan,Xiaoqun Zhang
2023-08-11
Abstract:The performance of graph representation learning is affected by the quality of graph input. While existing research usually pursues a globally smoothed graph embedding, we believe the rarely observed anomalies are as well harmful to an accurate prediction. This work establishes a graph learning scheme that automatically detects (locally) corrupted feature attributes and recovers robust embedding for prediction tasks. The detection operation leverages a graph autoencoder, which does not make any assumptions about the distribution of the local corruptions. It pinpoints the positions of the anomalous node attributes in an unbiased mask matrix, where robust estimations are recovered with sparsity promoting regularizer. The optimizer approaches a new embedding that is sparse in the framelet domain and conditionally close to input observations. Extensive experiments are provided to validate our proposed model can recover a robust graph representation from black-box poisoning and achieve excellent performance.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to recover a robust graph representation to improve the accuracy of prediction tasks when the feature attributes of the input graph are locally corrupted (i.e., there are local anomalies or noise) in graph representation learning. Specifically, the paper focuses on how to automatically detect these corrupted feature attributes and recover a robust embedding representation when there are a small number of hidden local corruptions in the input data in graph neural networks (GNNs). These problems are crucial for improving the performance of graph representation learning because even a small number of local corruptions can have a negative impact on the model's prediction ability. ### Main contributions of the paper 1. **Proposed a "detect - recover" strategy**: The paper proposed a model named MAGnet. By combining three key components: mask matrix generation, local optimization based on the alternating direction method of multipliers (ADMM), and graph representation learning, it protects graph representation learning from the influence of a small number of hidden corruptions in the input node features. 2. **Automatically detect suspicious attributes**: MAGnet uses an unsupervised encoder module to first detect suspicious feature attributes and generate a mask matrix to mark these positions. This process does not require prior knowledge of the distribution of abnormal attributes. 3. **Robust reconstruction based on sparse regularization**: Guide the robust reconstruction of the initial input through the constructed mask matrix, and use sparse - promoting regularization techniques to ensure that the focus is mainly on the key (abnormal) points when recovering the signal. 4. **Multiscale frame transformation**: In order to ensure that the expression can reconstruct irregular attributes, preserve low - energy patterns and eliminate global noise, the regularization operation is carried out on the multiscale frame transformation coefficients. 5. **Experimental verification**: Through extensive experiments, it is verified that the proposed model can recover a robust graph representation from black - box poisoning attacks and achieves excellent performance on multiple benchmark datasets. ### Key technologies of the solution - **Mask matrix generation**: Detect local corruptions in the graph through the graph auto - encoder (GAE) and generate a mask matrix. - **Local robust optimization**: Use the ADMM algorithm for local optimization to recover the damaged feature representation. - **Graph representation learning**: Continuously smooth the hidden feature representation through the graph convolution layer for the prediction tasks of GNNs. ### Experimental results The paper verifies the effectiveness of MAGnet through a variety of experiments, including comparison experiments with existing denoising methods, analysis of the influence of different parameter selections on the model performance, and visual display of the effects of the two learning modules (mask matrix generation and ADMM optimization). The experimental results show that MAGnet can significantly improve the robustness and prediction performance of graph representation learning when dealing with locally corrupted graph data. ### Conclusion The paper proposes a new framework, MAGnet, aiming to solve the problem of locally corrupted feature attributes in graph representation learning. By automatically detecting and recovering the damaged features, MAGnet can provide more accurate and robust graph representations in the presence of local corruptions, thereby improving the performance of prediction tasks.