Data Augmentation for Supervised Graph Outlier Detection via Latent Diffusion Models

Kay Liu,Hengrui Zhang,Ziqing Hu,Fangxin Wang,Philip S. Yu
2024-11-23
Abstract:A fundamental challenge confronting supervised graph outlier detection algorithms is the prevalent problem of class imbalance, where the scarcity of outlier instances compared to normal instances often results in suboptimal performance. Recently, generative models, especially diffusion models, have demonstrated their efficacy in synthesizing high-fidelity images. Despite their extraordinary generation quality, their potential in data augmentation for supervised graph outlier detection remains largely underexplored. To bridge this gap, we introduce GODM, a novel data augmentation for mitigating class imbalance in supervised Graph Outlier detection via latent Diffusion Models. Extensive experiments conducted on multiple datasets substantiate the effectiveness and efficiency of GODM. The case study further demonstrated the generation quality of our synthetic data. To foster accessibility and reproducibility, we encapsulate GODM into a plug-and-play package and release it at PyPI: <a class="link-external link-https" href="https://pypi.org/project/godm/" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Social and Information Networks
What problem does this paper attempt to address?