Abstract:In this paper, we aim to evaluate multi-agent systems against complex dependencies, including spatial, causal, and temporal constraints. First, we construct a new benchmark, named VillagerBench, within the Minecraft environment.VillagerBench comprises diverse tasks crafted to test various aspects of multi-agent collaboration, from workload distribution to dynamic adaptation and synchronized task execution. Second, we introduce a Directed Acyclic Graph Multi-Agent Framework VillagerAgent to resolve complex inter-agent dependencies and enhance collaborative efficiency. This solution incorporates a task decomposer that creates a directed acyclic graph (DAG) for structured task management, an agent controller for task distribution, and a state manager for tracking environmental and agent data. Our empirical evaluation on VillagerBench demonstrates that VillagerAgent outperforms the existing AgentVerse model, reducing hallucinations and improving task decomposition efficacy. The results underscore VillagerAgent's potential in advancing multi-agent collaboration, offering a scalable and generalizable solution in dynamic environments. The source code is open-source on GitHub (<a class="link-external link-https" href="https://github.com/cnsdqd-dyb/VillagerAgent" rel="external noopener nofollow">this https URL</a>).

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to evaluate the capability of multi-agent systems in handling complex dependencies, including spatial, causal, and temporal constraints. Specifically, the paper constructs a new benchmark environment **VillagerBench** and designs various tasks within it to test different aspects of multi-agent collaboration, from workload distribution to dynamic adaptation and synchronized task execution. ### Main Contributions 1. **Construction of a New Benchmark Environment**: - **VillagerBench**: A multi-agent benchmark environment built within the Minecraft setting, containing various tasks to evaluate different aspects of multi-agent collaboration. 2. **Introduction of a New Multi-Agent Framework**: - **VillagerAgent**: A multi-agent framework based on Directed Acyclic Graphs (DAG) to address complex inter-agent dependencies and improve collaboration efficiency. This framework includes a task decomposer, agent controller, state manager, and basic agents. 3. **Empirical Evaluation**: - Empirical evaluations conducted on **VillagerBench** demonstrate that the **VillagerAgent** framework outperforms the existing **AgentVerse** model in reducing hallucination behaviors and improving task decomposition efficiency. ### Specific Tasks 1. **Construction Cooperation Task**: - Tests the agents' ability to understand task requirements and coordinate team workload, focusing on spatial dependencies in multi-agent collaboration. 2. **Farm-to-Table Cooking Task**: - Evaluates the agents' ability to adapt to environmental changes, aiming to address complex causal dependencies. 3. **Escape Room Challenge Task**: - Tests the agents' ability to execute tasks sequentially and in parallel, requiring reasoning about temporal dependencies and synchronized actions. ### Framework Components 1. **Task Decomposer**: - Generates a Directed Acyclic Graph (DAG) of tasks, managing the structure of subtasks. 2. **Agent Controller**: - Responsible for task allocation, assigning subtasks to basic agents based on the environment and agent states. 3. **State Manager**: - Maintains state information of the environment and agents. 4. **Basic Agents**: - Execute assigned subtasks and perform self-reflection. ### Experimental Results - **GPT-4-1106-preview** combined with the **VillagerAgent** framework showed the best task completion rate and resource utilization efficiency. - **Gemini-Pro** demonstrated higher efficiency in the escape room task. - **VillagerAgent** exhibited lower hallucination behaviors and failure rates compared to **AgentVerse** in the farm-to-table cooking task. ### Conclusion By constructing the **VillagerBench** and **VillagerAgent** frameworks, the paper makes significant progress in evaluating and improving multi-agent systems' handling of complex dependencies, providing a scalable and general solution for multi-agent collaboration in dynamic environments.

VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft

Mcu: A task-centric framework for open-ended agent evaluation in minecraft

MindAgent: Emergent Gaming Interaction

TeamCraft: A Benchmark for Multi-Modal Multi-Agent Systems in Minecraft

Odyssey: Empowering Minecraft Agents with Open-World Skills

Towards Evaluating Generalist Agents: An Automated Benchmark in Open World

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments Via Large Language Models with Text-based Knowledge and Memory

Heterogeneous multi-agent task allocation based on graph neural network ant colony optimization algorithms

TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation

Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agents

S-Agents: Self-organizing Agents in Open-ended Environments

Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents.

Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction

MineLand: Simulating Large-Scale Multi-Agent Interactions with Limited Multimodal Senses and Physical Needs

STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft

MineStudio: A Streamlined Package for Minecraft AI Agent Development

Learning Multi-Agent Communication from Graph Modeling Perspective