Abstract:In recent years, graph pre-training has gained significant attention, focusing on acquiring transferable knowledge from unlabeled graph data to improve downstream performance. Despite these recent endeavors, the problem of negative transfer remains a major concern when utilizing graph pre-trained models to downstream tasks. Previous studies made great efforts on the issue of what to pre-train and how to pre-train by designing a variety of graph pre-training and fine-tuning strategies. However, there are cases where even the most advanced "pre-train and fine-tune" paradigms fail to yield distinct benefits. This paper introduces a generic framework W2PGNN to answer the crucial question of when to pre-train (i.e., in what situations could we take advantage of graph pre-training) before performing effortful pre-training or fine-tuning. We start from a new perspective to explore the complex generative mechanisms from the pre-training data to downstream data. In particular, W2PGNN first fits the pre-training data into graphon bases, each element of graphon basis (i.e., a graphon) identifies a fundamental transferable pattern shared by a collection of pre-training graphs. All convex combinations of graphon bases give rise to a generator space, from which graphs generated form the solution space for those downstream data that can benefit from pre-training. In this manner, the feasibility of pre-training can be quantified as the generation probability of the downstream data from any generator in the generator space. W2PGNN offers three broad applications: providing the application scope of graph pre-trained models, quantifying the feasibility of pre-training, and assistance in selecting pre-training data to enhance downstream performance. We provide a theoretically sound solution for the first application and extensive empirical justifications for the latter two applications.

Better with Less: Data-Active Pre-training of Graph Neural Networks

Better with Less: A Data-Active Perspective on Pre-Training Graph Neural Networks

NGAT: Attention in Breadth and Depth Exploration for Semi-Supervised Graph Representation Learning

Strategies for Pre-training Graph Neural Networks

Pre-Training Graph Neural Networks for Generic Structural Feature Extraction

When to Pre-Train Graph Neural Networks? from Data Generation Perspective!

GPT-GNN: Generative Pre-Training of Graph Neural Networks

An Adaptive Graph Pre-training Framework for Localized Collaborative Filtering

A Pure Transformer Pretraining Framework on Text-attributed Graphs

Scalable Multi-Source Pre-training for Graph Neural Networks

ULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual Prompt.

GraphControl: Adding Conditional Control to Universal Graph Pre-trained Models for Graph Domain Transfer Learning

Graph Pre-training for Reconnaissance Perception in Automated Penetration Testing

Graph Pre-Training Models Are Strong Anomaly Detectors

CPDG: A Contrastive Pre-Training Method for Dynamic Graph Neural Networks

One Model for One Graph: A New Perspective for Pretraining with Cross-domain Graphs

All in One and One for All: A Simple yet Effective Method towards Cross-domain Graph Pretraining

Hypergraph Pre-training with Graph Neural Networks

Graph Neural Network Training with Data Tiering