TaskGen: A Task-Based, Memory-Infused Agentic Framework using StrictJSON

John Chong Min Tan,Prince Saroj,Bharat Runwal,Hardik Maheshwari,Brian Lim Yi Sheng,Richard Cottrill,Alankrit Chona,Ambuj Kumar,Mehul Motani
2024-07-22
Abstract:TaskGen is an open-sourced agentic framework which uses an Agent to solve an arbitrary task by breaking them down into subtasks. Each subtask is mapped to an Equipped Function or another Agent to execute. In order to reduce verbosity (and hence token usage), TaskGen uses StrictJSON that ensures JSON output from the Large Language Model (LLM), along with additional features such as type checking and iterative error correction. Key to the philosophy of TaskGen is the management of information/memory on a need-to-know basis. We empirically evaluate TaskGen on various environments such as 40x40 dynamic maze navigation with changing obstacle locations (100% solve rate), TextWorld escape room solving with dense rewards and detailed goals (96% solve rate), web browsing (69% of actions successful), solving the MATH dataset (71% solve rate over 100 Level-5 problems), Retrieval Augmented Generation on NaturalQuestions dataset (F1 score of 47.03%)
Artificial Intelligence,Multiagent Systems
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to construct an intelligent agent framework capable of handling any task. Specifically, the paper proposes a new open - source agent framework named TaskGen, which achieves this goal by decomposing complex tasks into subtasks and assigning an equipped function or internal agent to each subtask for execution. The main features of TaskGen include: 1. **Reducing Redundancy**: Use the StrictJSON (Strict JSON) format to reduce output redundancy and ensure that the output is concise and easy to be used for downstream tasks. 2. **Shared Memory**: Share information on a need - based basis to improve efficiency and accuracy. The shared memory is divided into two parts: a list of completed subtasks and shared variables. 3. **Global Context**: Provide important information that changes dynamically, enabling the agent to adapt to environmental changes. 4. **Memory Bank**: Through Retrieval Augmented Generation (RAG) technology, extract relevant information from the memory bank according to the semantic similarity of tasks to enhance the capabilities of the agent. The paper evaluates the effectiveness of TaskGen through experiments in multiple environments, including dynamic maze navigation, TextWorld escape - room solving, web browsing, solving problems in the MATH dataset, and RAG - based natural question answering, demonstrating its strong performance in various tasks.