Abstract:The increasing prevalence of large-scale graphs poses a significant challenge for graph neural network training, attributed to their substantial computational requirements. In response, graph condensation (GC) emerges as a promising data-centric solution aiming to substitute the large graph with a small yet informative condensed graph to facilitate data-efficient GNN training. However, existing GC methods suffer from intricate optimization processes, necessitating excessive computing resources. In this paper, we revisit existing GC optimization strategies and identify two pervasive issues: 1. various GC optimization strategies converge to class-level node feature matching between the original and condensed graphs, making the optimization target coarse-grained despite the complex computations; 2. to bridge the original and condensed graphs, existing GC methods rely on a Siamese graph network architecture that requires time-consuming bi-level optimization with iterative gradient computations. To overcome these issues, we propose a training-free GC framework termed Class-partitioned Graph Condensation (CGC), which refines the node feature matching from the class-to-class paradigm into a novel class-to-node paradigm. Remarkably, this refinement also simplifies the GC optimization as a class partition problem, which can be efficiently solved by any clustering methods. Moreover, CGC incorporates a pre-defined graph structure to enable a closed-form solution for condensed node features, eliminating the back-and-forth gradient descent in existing GC approaches without sacrificing accuracy. Extensive experiments demonstrate that CGC achieves state-of-the-art performance with a more efficient condensation process. For instance, compared with the seminal GC method (i.e., GCond), CGC condenses the largest Reddit graph within 10 seconds, achieving a 2,680X speedup and a 1.4% accuracy increase.

Heterogeneous Graph Condensation

TinyGraph: Joint Feature and Node Condensation for Graph Neural Networks

Disentangled Condensation for Large-scale Graphs

Graph Condensation for Inductive Node Representation Learning

Graph Condensation: A Survey

RobGC: Towards Robust Graph Condensation

Graph Condensation for Open-World Graph Learning

Simple Graph Condensation

Contrastive Graph Condensation: Advancing Data Versatility through Self-Supervised Learning

Graph Data Condensation via Self-expressive Graph Structure Reconstruction

Rethinking and Accelerating Graph Condensation: A Training-Free Approach with Class Partition

GCondenser: Benchmarking Graph Condensation

Structure-free Graph Condensation: From Large-scale Graphs to Condensed Graph-free Data

Federated Graph Condensation with Information Bottleneck Principles

GCondNet: A Novel Method for Improving Neural Networks on Small High-Dimensional Tabular Data

BG-HGNN: Toward Scalable and Efficient Heterogeneous Graph Neural Network

Fast Graph Condensation with Structure-based Neural Tangent Kernel

The Heterophilic Snowflake Hypothesis: Training and Empowering GNNs for Heterophilic Graphs

EXGC: Bridging Efficiency and Explainability in Graph Condensation

Exploiting Neighbor Effect: Conv-Agnostic GNN Framework for Graphs with Heterophily

Interpretable and Efficient Heterogeneous Graph Convolutional Network