Abstract:Graphs are widely used for modeling relational data in real-world scenarios, such as social networks and urban computing. Existing LLM-based graph analysis approaches either integrate graph neural networks (GNNs) for specific machine learning tasks, limiting their transferability, or rely solely on LLMs' internal reasoning ability, resulting in suboptimal performance. To address these limitations, we take advantage of recent advances in LLM-based agents, which have shown capabilities of utilizing external knowledge or tools for problem solving. By simulating human problem-solving strategies such as analogy and collaboration, we propose a multi-agent system based on LLMs named GraphTeam, for graph analysis. GraphTeam consists of five LLM-based agents from three modules, and the agents with different specialities can collaborate with each other to address complex problems. Specifically, (1) input-output normalization module: the question agent extracts and refines four key arguments from the original question, facilitating the problem understanding, and the answer agent organizes the results to meet the output requirement; (2) external knowledge retrieval module: we first build a knowledge base consisting of relevant documentation and experience information, and then the search agent retrieves the most relevant entries for each question. (3) problem-solving module: given the retrieved information from search agent, the coding agent uses established algorithms via programming to generate solutions, and in case the coding agent does not work, the reasoning agent will directly compute the results without programming. Extensive experiments on six graph analysis benchmarks demonstrate that GraphTeam achieves state-of-the-art performance with an average 25.85% improvement over the best baseline in terms of accuracy. The code and data are available at <a class="link-external link-https" href="https://github.com/BUPT-GAMMA/GraphTeam" rel="external noopener nofollow">this https URL</a>.

AutoGraph: Enabling Visual Context Via Graph Alignment in Open Domain Multi-Modal Dialogue Generation

SKANet - Structured Knowledge-Aware Network for Visual Dialog.

Advancement in Graph Understanding: A Multimodal Benchmark and Fine-Tuning of Vision-Language Models

Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation

VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context

Scene Dynamics: Counterfactual Critic Multi-Agent Training for Scene Graph Generation.

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog.

Graphologue: Exploring Large Language Model Responses with Interactive Diagrams

Open Domain Dialogue Generation with Latent Images

GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration

From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models

ORD: Object Relationship Discovery for Visual Dialogue Generation

MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation

Multi-task learning with graph attention networks for multi-domain task-oriented dialogue systems

GRADE: Automatic Graph-Enhanced Coherence Metric for Evaluating Open-Domain Dialogue Systems

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

DriveLM: Driving with Graph Visual Question Answering

Structure-Aware Multimodal Sequential Learning for Visual Dialog

Exploring Graph Structure Comprehension Ability of Multimodal Large Language Models: Case Studies

Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation

Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models