Abstract:The need to analyze graphs is ubiquitous across various fields, from social networks to biological research and recommendation systems. Therefore, enabling the ability of large language models (LLMs) to process graphs is an important step toward more advanced general intelligence. However, current LLM benchmarks on graph analysis require models to directly reason over the prompts describing graph topology, and are thus limited to small graphs with only a few dozens of nodes. In contrast, human experts typically write programs based on popular libraries for task solving, and can thus handle graphs with different scales. To this end, a question naturally arises: can LLMs analyze graphs like professionals? In this paper, we introduce ProGraph, a manually crafted benchmark containing 3 categories of graph tasks. The benchmark expects solutions based on programming instead of directly reasoning over raw inputs. Our findings reveal that the performance of current LLMs is unsatisfactory, with the best model achieving only 36% accuracy. To bridge this gap, we propose LLM4Graph datasets, which include crawled documents and auto-generated codes based on 6 widely used graph libraries. By augmenting closed-source LLMs with document retrieval and fine-tuning open-source ones on the codes, we show 11-32% absolute improvements in their accuracies. Our results underscore that the capabilities of LLMs in handling structured data are still under-explored, and show the effectiveness of LLM4Graph in enhancing LLMs' proficiency of graph analysis. The benchmark, datasets and enhanced open-source models are available at <a class="link-external link-https" href="https://github.com/BUPT-GAMMA/ProGraph" rel="external noopener nofollow">this https URL</a>.

Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

When Graph Data Meets Multimodal: A New Paradigm for Graph Understanding and Reasoning

GraphEval2000: Benchmarking and Improving Large Language Models on Graph Datasets

Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies

GraphInstruct: Empowering Large Language Models with Graph Understanding and Reasoning Capability

GPT4Graph: Can Large Language Models Understand Graph Structured Data ? an Empirical Evaluation and Benchmarking.

GraphWiz: An Instruction-Following Language Model for Graph Problems

GITA: Graph to Visual and Textual Integration for Vision-Language Graph Reasoning

InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference Alignment

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

GraphLLM: Boosting Graph Reasoning Ability of Large Language Model

Large Language Models on Graphs: A Comprehensive Survey

DriveLM: Driving with Graph Visual Question Answering

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models

Evaluating Large Language Models on Graphs: Performance Insights and Comparative Analysis

Joint Embeddings for Graph Instruction Tuning

CogCoM: Train Large Vision-Language Models Diving into Details through Chain of Manipulations

Visualization Literacy of Multimodal Large Language Models: A Comparative Study

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models