TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

John Chong Min Tan,Prince Saroj,Bharat Runwal,Hardik Maheshwari,Brian Lim Yi Sheng,Richard Cottrill,Alankrit Chona,Ambuj Kumar,Mehul Motani

2024-07-22

Abstract:TaskGen is an open-sourced agentic framework which uses an Agent to solve an arbitrary task by breaking them down into subtasks. Each subtask is mapped to an Equipped Function or another Agent to execute. In order to reduce verbosity (and hence token usage), TaskGen uses StrictJSON that ensures JSON output from the Large Language Model (LLM), along with additional features such as type checking and iterative error correction. Key to the philosophy of TaskGen is the management of information/memory on a need-to-know basis. We empirically evaluate TaskGen on various environments such as 40x40 dynamic maze navigation with changing obstacle locations (100% solve rate), TextWorld escape room solving with dense rewards and detailed goals (96% solve rate), web browsing (69% of actions successful), solving the MATH dataset (71% solve rate over 100 Level-5 problems), Retrieval Augmented Generation on NaturalQuestions dataset (F1 score of 47.03%)

Artificial Intelligence,Multiagent Systems

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to construct an intelligent agent framework capable of handling any task. Specifically, the paper proposes a new open - source agent framework named TaskGen, which achieves this goal by decomposing complex tasks into subtasks and assigning an equipped function or internal agent to each subtask for execution. The main features of TaskGen include: 1. **Reducing Redundancy**: Use the StrictJSON (Strict JSON) format to reduce output redundancy and ensure that the output is concise and easy to be used for downstream tasks. 2. **Shared Memory**: Share information on a need - based basis to improve efficiency and accuracy. The shared memory is divided into two parts: a list of completed subtasks and shared variables. 3. **Global Context**: Provide important information that changes dynamically, enabling the agent to adapt to environmental changes. 4. **Memory Bank**: Through Retrieval Augmented Generation (RAG) technology, extract relevant information from the memory bank according to the semantic similarity of tasks to enhance the capabilities of the agent. The paper evaluates the effectiveness of TaskGen through experiments in multiple environments, including dynamic maze navigation, TextWorld escape - room solving, web browsing, solving problems in the MATH dataset, and RAG - based natural question answering, demonstrating its strong performance in various tasks.

TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

Coding Reliable LLM-based Integrated Task and Knowledge Agents with GenieWorksheets

ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games

High-Level, Collaborative Task Planning Grammar and Execution for Heterogeneous Agents

TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation

Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset

HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model

Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework

AgentKit: Structured LLM Reasoning with Dynamic Graphs

ToolGen: Unified Tool Retrieval and Calling via Generation

ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution

GTA: A Benchmark for General Tool Agents

CPS-TaskForge: Generating Collaborative Problem Solving Environments for Diverse Communication Tasks

TaskGenie: Crowd-Powered Task Generation for Struggling Search

GraphAgent: Agentic Graph Language Assistant

Ghost in the Minecraft: Generally Capable Agents for Open-World Environments Via Large Language Models with Text-based Knowledge and Memory

SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks

STRATA: A Unified Framework for Task Assignments in Large Teams of Heterogeneous Agents

Functional Task Tree Generation from a Knowledge Graph to Solve Unseen Problems