Graph Meets LLMs: Towards Large Graph Models

Ziwei Zhang,Haoyang Li,Zeyang Zhang,Yijian Qin,Xin Wang,Wenwu Zhu
DOI: https://doi.org/10.48550/arXiv.2308.14522
2023-11-11
Abstract:Large models have emerged as the most recent groundbreaking achievements in artificial intelligence, and particularly machine learning. However, when it comes to graphs, large models have not achieved the same level of success as in other fields, such as natural language processing and computer vision. In order to promote applying large models for graphs forward, we present a perspective paper to discuss the challenges and opportunities associated with developing large graph models. First, we discuss the desired characteristics of large graph models. Then, we present detailed discussions from three key perspectives: representation basis, graph data, and graph models. In each category, we provide a brief overview of recent advances and highlight the remaining challenges together with our visions. Finally, we discuss valuable applications of large graph models. We believe this perspective can encourage further investigations into large graph models, ultimately pushing us one step closer towards artificial general intelligence (AGI). We are the first to comprehensively study large graph models, to the best of our knowledge.
Machine Learning,Artificial Intelligence,Social and Information Networks
What problem does this paper attempt to address?
The problems that this paper attempts to solve are: the challenges and opportunities faced when applying large - scale models (such as large language models LLMs) on graph - structured data. Although large - scale models have achieved remarkable success in fields such as natural language processing (NLP) and computer vision (CV), their application on graph data is relatively lagging. The paper explores how to overcome these obstacles by constructing large - scale graph models and discusses the potential application value of large - scale graph models in different fields, aiming to promote the development of the graph learning field and ultimately contribute to the progress towards artificial general intelligence (AGI). Specifically, the paper explores the key issues of large - scale graph models from the following aspects: 1. **Desired Characteristics of Large - Scale Graph Models**: - **Graph Models with Scaling Laws**: Similar to large - scale language models, the performance of large - scale graph models continuously improves as the model size, data set size, and training computation increase. - **Graph Foundation Models**: They can handle different graph tasks across multiple domains and have the ability to understand graph - structure information and attributes. - **Graph Understanding and Processing Capabilities in Context**: They can understand and process new graph data sets and tasks at the test stage, requiring only a small number of samples and without the need for extensive modification of the model. - **Diverse Graph Reasoning Abilities**: They can handle tasks involving local and global graph properties, such as node degree, shortest path, centrality, etc. 2. **Graph Representation Foundation**: - **Graph Domains and Transferability**: It explores the differences and commonalities between different graph domains and how to find a unified representation foundation suitable for multiple graph domains. - **Alignment with Natural Language**: It studies how to enhance the interactivity and instruction - execution ability of graph models through the alignment of natural language and graph data. 3. **Graph Data**: - It emphasizes the importance of high - quality, large - scale graph data sets and points out that the current scale of graph data sets is far smaller than that of data sets in the NLP and CV fields. - It proposes principles for collecting more graph data, including domain diversity, type diversity, statistical diversity, task diversity, and modality diversity. 4. **Graph Models**: - **Backbone Architectures**: It discusses two main graph model architectures, graph neural networks (GNNs) and graph Transformers, and analyzes their advantages, disadvantages, and future development directions. - **Pre - training**: It emphasizes the importance of pre - training in large - scale graph models and introduces pre - training strategies such as contrastive learning and generation / prediction methods. - **Post - processing**: It explores how to enhance the downstream task adaptability of pre - trained models through techniques such as prompting, parameter - efficient fine - tuning, reinforcement learning, and model compression. Overall, the paper aims to comprehensively analyze the development status and future direction of large - scale graph models and provide theoretical and practical guidance for researchers.