Test-Time Training on Graphs with Large Language Models (LLMs)

Jiaxin Zhang,Yiqi Wang,Xihong Yang,Siwei Wang,Yu Feng,Yu Shi,Ruicaho Ren,En Zhu,Xinwang Liu

2024-04-21

Abstract:Graph Neural Networks have demonstrated great success in various fields of multimedia. However, the distribution shift between the training and test data challenges the effectiveness of GNNs. To mitigate this challenge, Test-Time Training (TTT) has been proposed as a promising approach. Traditional TTT methods require a demanding unsupervised training strategy to capture the information from test to benefit the main task. Inspired by the great annotation ability of Large Language Models (LLMs) on Text-Attributed Graphs (TAGs), we propose to enhance the test-time training on graphs with LLMs as annotators. In this paper, we design a novel Test-Time Training pipeline, LLMTTT, which conducts the test-time adaptation under the annotations by LLMs on a carefully-selected node set. Specifically, LLMTTT introduces a hybrid active node selection strategy that considers not only node diversity and representativeness, but also prediction signals from the pre-trained model. Given annotations from LLMs, a two-stage training strategy is designed to tailor the test-time model with the limited and noisy labels. A theoretical analysis ensures the validity of our method and extensive experiments demonstrate that the proposed LLMTTT can achieve a significant performance improvement compared to existing Out-of-Distribution (OOD) generalization methods.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

This paper proposes a solution to the performance degradation of Graph Neural Networks (GNNs) when there is a discrepancy between the training data and the testing data distribution, namely Out-of-Distribution (OOD). Traditional Test-Time Training (TTT) methods rely on unsupervised strategies to adapt to the testing data. However, inspired by the annotation capability of Large Language Models (LLMs) on Text-Attributed Graphs (TAGs), the authors propose a new method called LLMTTT. LLMTTT utilizes the annotation of LLMs on a carefully selected set of nodes for testing time adaptation and adopts a hybrid active node selection strategy that considers node diversity, representativeness, and the prediction signal of pre-trained models. To deal with the limited and noisy labels generated by LLMs, a two-stage training strategy is designed to adjust the model. The main contributions of this paper include: 1. Introducing a new pipeline LLMTTT that utilizes LLMs as annotators to obtain pseudo-labels for test-time training to alleviate the OOD problem on graph data. 2. Developing a hybrid active node selection strategy that considers not only node diversity and representativeness, but also the prediction signal of pre-trained models. 3. Designing a two-stage training strategy to adapt to limited and noisy labeled samples. 4. Demonstrating the effectiveness of LLMTTT on various OOD graphs through experiments and theoretical analysis. The researchers showcase the significant performance improvement of LLMTTT compared to existing OOD generalization methods through experiments and provide theoretical analysis to ensure its effectiveness.

Test-Time Training on Graphs with Large Language Models (LLMs)

Test-Time Training for Graph Neural Networks

GraphTTA: Test Time Adaptation on Graph Neural Networks

Disentangled Representation Learning with Large Language Models for Text-Attributed Graphs

Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs

Large Language Model-based Augmentation for Imbalanced Node Classification on Text-Attributed Graphs

Distilling Large Language Models for Text-Attributed Graph Learning

GraphGPT: Graph Instruction Tuning for Large Language Models

Can GNN be Good Adapter for LLMs?

Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning

NT-LLM: A Novel Node Tokenizer for Integrating Graph Structure into Large Language Models

Leveraging Large Language Models for Node Generation in Few-Shot Learning on Text-Attributed Graphs

Exploring the Potential of Large Language Models (LLMs) in Learning on Graphs

A Fully Test-Time Training Framework for Semi-Supervised Node Classification on Out-of-Distribution Graphs

Enhance Graph Alignment for Large Language Models

Enhancing Graph Neural Networks with Limited Labeled Data by Actively Distilling Knowledge from Large Language Models

Efficient and effective training of language and graph neural network models

Efficient End-to-end Language Model Fine-tuning on Graphs

Empower Text-Attributed Graphs Learning with Large Language Models (LLMs)

Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning

Graph Learning in the Era of LLMs: A Survey from the Perspective of Data, Models, and Tasks