GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning

Yun Zhu,Yaoke Wang,Haizhou Shi,Zhenshuo Zhang,Dian Jiao,Siliang Tang
DOI: https://doi.org/10.1145/3589334.3645439
2024-01-01
Abstract:Graph-structured data is ubiquitous in the world which models complexrelationships between objects, enabling various Web applications. Dailyinfluxes of unlabeled graph data on the Web offer immense potential for theseapplications. Graph self-supervised algorithms have achieved significantsuccess in acquiring generic knowledge from abundant unlabeled graph data.These pre-trained models can be applied to various downstream Web applications,saving training time and improving downstream (target) performance. However,different graphs, even across seemingly similar domains, can differsignificantly in terms of attribute semantics, posing difficulties, if notinfeasibility, for transferring the pre-trained models to downstream tasks.Concretely speaking, for example, the additional task-specific node informationin downstream tasks (specificity) is usually deliberately omitted so that thepre-trained representation (transferability) can be leveraged. The trade-off assuch is termed as "transferability-specificity dilemma" in this work. Toaddress this challenge, we introduce an innovative deployment module coined asGraphControl, motivated by ControlNet, to realize better graph domain transferlearning. Specifically, by leveraging universal structural pre-trained modelsand GraphControl, we align the input space across various graphs andincorporate unique characteristics of target data as conditional inputs. Theseconditions will be progressively integrated into the model during fine-tuningor prompt tuning through ControlNet, facilitating personalized deployment.Extensive experiments show that our method significantly enhances theadaptability of pre-trained models on target attributed datasets, achieving1.4-3x performance gain. Furthermore, it outperforms training-from-scratchmethods on target data with a comparable margin and exhibits fasterconvergence.
What problem does this paper attempt to address?