Homophily-Driven Sanitation View for Robust Graph Contrastive Learning

Yulin Zhu,Xing Ai,Yevgeniy Vorobeychik,Kai Zhou
2023-07-24
Abstract:We investigate adversarial robustness of unsupervised Graph Contrastive Learning (GCL) against structural attacks. First, we provide a comprehensive empirical and theoretical analysis of existing attacks, revealing how and why they downgrade the performance of GCL. Inspired by our analytic results, we present a robust GCL framework that integrates a homophily-driven sanitation view, which can be learned jointly with contrastive learning. A key challenge this poses, however, is the non-differentiable nature of the sanitation objective. To address this challenge, we propose a series of techniques to enable gradient-based end-to-end robust GCL. Moreover, we develop a fully unsupervised hyperparameter tuning method which, unlike prior approaches, does not require knowledge of node labels. We conduct extensive experiments to evaluate the performance of our proposed model, GCHS (Graph Contrastive Learning with Homophily-driven Sanitation View), against two state of the art structural attacks on GCL. Our results demonstrate that GCHS consistently outperforms all state of the art baselines in terms of the quality of generated node embeddings as well as performance on two important downstream tasks.
Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily explores the robustness of unsupervised Graph Contrastive Learning (GCL) under structural attacks. Specifically: 1. **Problem Background**: - Graph representation learning is widely used in various fields. - In practice, due to the scarcity of labeled data, unsupervised graph learning methods have gained significant attention. - Among them, Graph Contrastive Learning (GCL) has shown excellent performance in various downstream tasks as an efficient unsupervised method. - However, existing research indicates that GCL is susceptible to structural attacks, leading to a decline in the quality of node embeddings. 2. **Research Objectives**: - Design a robust GCL framework that maintains good performance in the face of structural attacks. - Existing defense methods for GNN models mostly rely on node label information, while GCL is a completely unsupervised method, making these methods inapplicable. 3. **Main Contributions**: - Propose a homophily-driven sanitation view to enhance the robustness of unsupervised graph contrastive learning. - Develop a series of techniques to achieve end-to-end gradient training, addressing non-differentiable issues. - Introduce a pseudo normalized cut loss that adjusts key hyperparameters without requiring node labels. - Experimental results show that the proposed GCHS model performs excellently under different attack settings, significantly outperforming existing baseline methods. ### Summary The paper aims to improve the robustness of unsupervised graph contrastive learning against structural attacks by introducing a homophily-driven sanitation view and developing a series of techniques to achieve this goal. Experimental results demonstrate the effectiveness of the proposed method.