Two Trades is not Baffled: Condensing Graph via Crafting Rational Gradient Matching

Tianle Zhang,Yuchen Zhang,Kun Wang,Kai Wang,Beining Yang,Kaipeng Zhang,Wenqi Shao,Ping Liu,Joey Tianyi Zhou,Yang You
2024-09-27
Abstract:Training on large-scale graphs has achieved remarkable results in graph representation learning, but its cost and storage have raised growing concerns. As one of the most promising directions, graph condensation methods address these issues by employing gradient matching, aiming to condense the full graph into a more concise yet information-rich synthetic set. Though encouraging, these strategies primarily emphasize matching directions of the gradients, which leads to deviations in the training trajectories. Such deviations are further magnified by the differences between the condensation and evaluation phases, culminating in accumulated errors, which detrimentally affect the performance of the condensed graphs. In light of this, we propose a novel graph condensation method named \textbf{C}raf\textbf{T}ing \textbf{R}ationa\textbf{L} trajectory (\textbf{CTRL}), which offers an optimized starting point closer to the original dataset's feature distribution and a more refined strategy for gradient matching. Theoretically, CTRL can effectively neutralize the impact of accumulated errors on the performance of condensed graphs. We provide extensive experiments on various graph datasets and downstream tasks to support the effectiveness of CTRL. Code is released at <a class="link-external link-https" href="https://github.com/NUS-HPC-AI-Lab/CTRL" rel="external noopener nofollow">this https URL</a>.
Machine Learning
What problem does this paper attempt to address?