GPS: Graph Contrastive Learning via Multi-scale Augmented Views from Adversarial Pooling

Wei Ju,Yiyang Gu,Zhengyang Mao,Ziyue Qiao,Yifang Qin,Xiao Luo,Hui Xiong,Ming Zhang
2024-01-29
Abstract:Self-supervised graph representation learning has recently shown considerable promise in a range of fields, including bioinformatics and social networks. A large number of graph contrastive learning approaches have shown promising performance for representation learning on graphs, which train models by maximizing agreement between original graphs and their augmented views (i.e., positive views). Unfortunately, these methods usually involve pre-defined augmentation strategies based on the knowledge of human experts. Moreover, these strategies may fail to generate challenging positive views to provide sufficient supervision signals. In this paper, we present a novel approach named Graph Pooling ContraSt (GPS) to address these issues. Motivated by the fact that graph pooling can adaptively coarsen the graph with the removal of redundancy, we rethink graph pooling and leverage it to automatically generate multi-scale positive views with varying emphasis on providing challenging positives and preserving semantics, i.e., strongly-augmented view and weakly-augmented view. Then, we incorporate both views into a joint contrastive learning framework with similarity learning and consistency learning, where our pooling module is adversarially trained with respect to the encoder for adversarial robustness. Experiments on twelve datasets on both graph classification and transfer learning tasks verify the superiority of the proposed method over its counterparts.
Machine Learning,Artificial Intelligence,Social and Information Networks
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper primarily aims to address several key issues in self-supervised graph representation learning: 1. **Manually Designed Augmentation Strategies**: Existing graph contrastive learning methods often rely on manually designed augmentation strategies (such as node deletion, edge perturbation, etc.), which require expert knowledge to select and may not be suitable for different datasets. Additionally, determining appropriate augmentation strategies in unknown domains requires extensive experimentation, which is inefficient. 2. **Difficulty in Generating Challenging Positive Samples**: Manually designed augmentation strategies may fail to generate sufficiently challenging positive samples, thus failing to provide adequate supervision signals. If the augmented views are too similar to the original samples, it may lead to representation collapse. To address the above issues, the authors propose a new method called **GPS** (GraphPooling Contra St). This method utilizes graph pooling techniques to adaptively generate multi-scale positive sample views, removing redundant information at different levels to generate challenging yet semantically meaningful augmented views. Specifically, GPS includes the following two aspects: - **Weakly Augmented Views**: Focuses on retaining semantic information. - **Strongly Augmented Views**: Focuses on removing redundant information to generate challenging positive samples. Additionally, GPS combines similarity learning and consistency learning frameworks and employs adversarial training to improve the robustness and effectiveness of the model. Experimental results show that GPS outperforms existing baseline methods on multiple datasets.