Graph Contrastive Learning with Cohesive Subgraph Awareness

Yucheng Wu,Leye Wang,Xiao Han,Han-Jia Ye
DOI: https://doi.org/10.1145/3589334.3645470
2024-02-22
Abstract:Graph contrastive learning (GCL) has emerged as a state-of-the-art strategy for learning representations of diverse graphs including social and biomedical networks. GCL widely uses stochastic graph topology augmentation, such as uniform node dropping, to generate augmented graphs. However, such stochastic augmentations may severely damage the intrinsic properties of a graph and deteriorate the following representation learning process. We argue that incorporating an awareness of cohesive subgraphs during the graph augmentation and learning processes has the potential to enhance GCL performance. To this end, we propose a novel unified framework called CTAug, to seamlessly integrate cohesion awareness into various existing GCL mechanisms. In particular, CTAug comprises two specialized modules: topology augmentation enhancement and graph learning enhancement. The former module generates augmented graphs that carefully preserve cohesion properties, while the latter module bolsters the graph encoder's ability to discern subgraph patterns. Theoretical analysis shows that CTAug can strictly improve existing GCL mechanisms. Empirical experiments verify that CTAug can achieve state-of-the-art performance for graph representation learning, especially for graphs with high degrees. The code is available at
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily addresses several key issues in Graph Contrastive Learning (GCL): 1. **Improvement of Graph Topology Augmentation Strategies**: - Existing graph topology augmentation methods (such as node deletion, edge removal, etc.) may severely disrupt the intrinsic properties of the graph, thereby affecting the effectiveness of subsequent representation learning. This paper proposes a new unified framework, CTAug, which introduces the concept of cohesive subgraphs to guide the graph augmentation process, thereby better preserving important structural features of the graph. 2. **Unified Framework Design**: - Most existing studies focus on designing specific GCL mechanisms without a general framework that can integrate graph properties into various GCL mechanisms. The CTAug framework aims to enhance existing GCL mechanisms through cohesive subgraph awareness, improving the performance of graph representation learning. 3. **More Expressive Network Models**: - Traditional Graph Neural Networks (GNNs) have limitations in capturing subgraph properties. To address this, the authors propose an Original-Graph-oriented Graph Substructure Network (O-GSN) to enhance the ability of GNNs to capture subgraph structural information. Through the above improvements, CTAug can significantly enhance the performance of existing methods in graph contrastive learning, especially in high-density graphs.