VillagerAgent: A Graph-Based Multi-Agent Framework for Coordinating Complex Task Dependencies in Minecraft

Yubo Dong,Xukun Zhu,Zhengzhe Pan,Linchao Zhu,Yi Yang
2024-06-09
Abstract:In this paper, we aim to evaluate multi-agent systems against complex dependencies, including spatial, causal, and temporal constraints. First, we construct a new benchmark, named VillagerBench, within the Minecraft environment.VillagerBench comprises diverse tasks crafted to test various aspects of multi-agent collaboration, from workload distribution to dynamic adaptation and synchronized task execution. Second, we introduce a Directed Acyclic Graph Multi-Agent Framework VillagerAgent to resolve complex inter-agent dependencies and enhance collaborative efficiency. This solution incorporates a task decomposer that creates a directed acyclic graph (DAG) for structured task management, an agent controller for task distribution, and a state manager for tracking environmental and agent data. Our empirical evaluation on VillagerBench demonstrates that VillagerAgent outperforms the existing AgentVerse model, reducing hallucinations and improving task decomposition efficacy. The results underscore VillagerAgent's potential in advancing multi-agent collaboration, offering a scalable and generalizable solution in dynamic environments. The source code is open-source on GitHub (<a class="link-external link-https" href="https://github.com/cnsdqd-dyb/VillagerAgent" rel="external noopener nofollow">this https URL</a>).
Artificial Intelligence,Multiagent Systems
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to evaluate the capability of multi-agent systems in handling complex dependencies, including spatial, causal, and temporal constraints. Specifically, the paper constructs a new benchmark environment **VillagerBench** and designs various tasks within it to test different aspects of multi-agent collaboration, from workload distribution to dynamic adaptation and synchronized task execution. ### Main Contributions 1. **Construction of a New Benchmark Environment**: - **VillagerBench**: A multi-agent benchmark environment built within the Minecraft setting, containing various tasks to evaluate different aspects of multi-agent collaboration. 2. **Introduction of a New Multi-Agent Framework**: - **VillagerAgent**: A multi-agent framework based on Directed Acyclic Graphs (DAG) to address complex inter-agent dependencies and improve collaboration efficiency. This framework includes a task decomposer, agent controller, state manager, and basic agents. 3. **Empirical Evaluation**: - Empirical evaluations conducted on **VillagerBench** demonstrate that the **VillagerAgent** framework outperforms the existing **AgentVerse** model in reducing hallucination behaviors and improving task decomposition efficiency. ### Specific Tasks 1. **Construction Cooperation Task**: - Tests the agents' ability to understand task requirements and coordinate team workload, focusing on spatial dependencies in multi-agent collaboration. 2. **Farm-to-Table Cooking Task**: - Evaluates the agents' ability to adapt to environmental changes, aiming to address complex causal dependencies. 3. **Escape Room Challenge Task**: - Tests the agents' ability to execute tasks sequentially and in parallel, requiring reasoning about temporal dependencies and synchronized actions. ### Framework Components 1. **Task Decomposer**: - Generates a Directed Acyclic Graph (DAG) of tasks, managing the structure of subtasks. 2. **Agent Controller**: - Responsible for task allocation, assigning subtasks to basic agents based on the environment and agent states. 3. **State Manager**: - Maintains state information of the environment and agents. 4. **Basic Agents**: - Execute assigned subtasks and perform self-reflection. ### Experimental Results - **GPT-4-1106-preview** combined with the **VillagerAgent** framework showed the best task completion rate and resource utilization efficiency. - **Gemini-Pro** demonstrated higher efficiency in the escape room task. - **VillagerAgent** exhibited lower hallucination behaviors and failure rates compared to **AgentVerse** in the farm-to-table cooking task. ### Conclusion By constructing the **VillagerBench** and **VillagerAgent** frameworks, the paper makes significant progress in evaluating and improving multi-agent systems' handling of complex dependencies, providing a scalable and general solution for multi-agent collaboration in dynamic environments.