Rethinking Reconstruction-based Graph-Level Anomaly Detection: Limitations and a Simple Remedy

Sunwoo Kim,Soo Yong Lee,Fanchen Bu,Shinhwan Kang,Kyungho Kim,Jaemin Yoo,Kijung Shin
2024-10-27
Abstract:Graph autoencoders (Graph-AEs) learn representations of given graphs by aiming to accurately reconstruct them. A notable application of Graph-AEs is graph-level anomaly detection (GLAD), whose objective is to identify graphs with anomalous topological structures and/or node features compared to the majority of the graph population. Graph-AEs for GLAD regard a graph with a high mean reconstruction error (i.e. mean of errors from all node pairs and/or nodes) as anomalies. Namely, the methods rest on the assumption that they would better reconstruct graphs with similar characteristics to the majority. We, however, report non-trivial counter-examples, a phenomenon we call reconstruction flip, and highlight the limitations of the existing Graph-AE-based GLAD methods. Specifically, we empirically and theoretically investigate when this assumption holds and when it fails. Through our analyses, we further argue that, while the reconstruction errors for a given graph are effective features for GLAD, leveraging the multifaceted summaries of the reconstruction errors, beyond just mean, can further strengthen the features. Thus, we propose a novel and simple GLAD method, named MUSE. The key innovation of MUSE involves taking multifaceted summaries of reconstruction errors as graph features for GLAD. This surprisingly simple method obtains SOTA performance in GLAD, performing best overall among 14 methods across 10 datasets.
Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to address the limitations in graph - level anomaly detection (GLAD) methods based on graph auto - encoders (Graph - AEs). Specifically, the authors point out the flaws in the existing methods when assuming that graph auto - encoders can better reconstruct graphs similar to the training graphs. However, through experimental and theoretical analysis, the authors find that this assumption does not always hold and propose a phenomenon called "reconstruction flip". #### Summary of main problems: 1. **Reconstruction flip phenomenon**: Even though some graphs are different from the training graphs, their reconstruction errors are lower instead. This indicates that the existing graph - auto - encoder - based methods may not be able to accurately detect abnormal graphs. 2. **Limitations of existing methods**: Many existing GLAD methods rely on the average reconstruction error to identify abnormal graphs, but this method may fail in some cases, resulting in abnormal graphs not being correctly detected. To solve these problems, the authors propose a new GLAD method - MUSE (Multifacted Summarization of Reconstruction Errors). MUSE enhances the feature representation by using multi - faceted reconstruction error summarization, thereby improving the effectiveness of anomaly detection. ### Specific contributions of the paper: 1. **Analysis and implications**: - The authors reveal the "reconstruction flip" phenomenon through empirical and theoretical analysis and explain the conditions under which it occurs. - They emphasize the limitations of relying solely on the average reconstruction error and propose using multi - faceted - summarized reconstruction error as a more effective feature. 2. **Effective method**: - They propose the MUSE method, which represents graphs by multi - faceted - summarized reconstruction error, thus enhancing the feature representation. - MUSE achieves state - of - the - art performance (SOTA) on multiple benchmark datasets, with a significant improvement over existing methods. 3. **Extensive experimental verification**: - They verify the effectiveness of MUSE through a large number of experiments, including its performance on different datasets and its robustness to training set contamination. - The experimental results show that MUSE can effectively distinguish abnormal graphs from normal graphs, and its key components are crucial for performance. ### Conclusion: This paper proposes an improved MUSE method by in - depth analysis of the limitations of existing graph - level anomaly detection methods, solves the possible problems of graph auto - encoders in anomaly detection, and verifies its effectiveness through extensive experiments.