Abstract:In graph classification, the out-of-distribution (OOD) issue is attracting great attention. To address this issue, a prevailing idea is to learn stable features, on the assumption that they are substructures causally determining the label and that their relationship with the label is stable to the distributional uncertainty. In contrast, the complementary parts termed environmental features, fail to determine the label solely and hold varying relationships with the label, thus ascribed to the possible reason for the distribution shift. Existing generalization efforts mainly encourage the model's insensitivity to environmental features. While the sensitivity to stable features is promising to distinguish the crucial clues from the distributional uncertainty but largely unexplored. A paradigm of simultaneously exploring the sensitivity to stable features and insensitivity to environmental features is until-now lacking to achieve the generalizable graph classification, to the best of our knowledge. In this work, we conjecture that generalizable models should be sensitive to stable features and insensitive to environmental features. To this end, we propose a simple yet effective augmentation strategy for graph classification: Equivariant and Invariant Cross-Data Augmentation (EI-CDA). By employing equivariance, given a pair of input graphs, we first estimate their stable and environmental features via masks. Then we linearly mix the estimated stable features of two graphs and encourage the model predictions faithfully reflect their mixed semantics. Meanwhile, by using invariance, we swap the estimated environmental features of two graphs and keep the predictions invariant. This simple yet effective strategy endows the models with both sensitivity to stable features and insensitivity to environmental features. Extensive experiments show that EI-CDA significantly improves performance and outperforms leading baselines. Our codes are available at: https://github.com/yongduosui/EI-GNN.

Rationalizing Graph Neural Networks with Data Augmentation

Cooperative Classification and Rationalization for Graph Generalization

Federated Self-Explaining GNNs with Anti-shortcut Augmentations

Knowledge Distillation Improves Graph Structure Augmentation for Graph Neural Networks

Graph Data Augmentation for Node Classification

Discovering Invariant Rationales for Graph Neural Networks

Towards data augmentation in graph neural network: An overview and evaluation

Self-attentive Rationalization for Graph Contrastive Learning

Self-attentive Rationalization for Interpretable Graph Contrastive Learning

A Simple Data Augmentation for Graph Classification: A Perspective of Equivariance and Invariance

Data Augmentation in Graph Neural Networks: The Role of Generated Synthetic Graphs

Neural Axiom Network for Knowledge Graph Reasoning

Robust Optimization as Data Augmentation for Large-scale Graphs

Efficient Topology-aware Data Augmentation for High-Degree Graph Neural Networks

RAGraph: A General Retrieval-Augmented Graph Learning Framework

GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification

Reinforcement Learning Enhanced Explainer for Graph Neural Networks

Scalable Graph Neural Networks for Heterogeneous Graphs

Local Augmentation for Graph Neural Networks

Asymmetric augmented paradigm-based graph neural architecture search

Counterfactual Data Augmentation with Denoising Diffusion for Graph Anomaly Detection