AUTOACT: Automatic Agent Learning from Scratch via Self-Planning

Shuofei Qiao,Ningyu Zhang,Runnan Fang,Yujie Luo,Wangchunshu Zhou,Yuchen Eleanor Jiang,Chengfei Lv,Huajun Chen
2024-02-17
Abstract:Language agents have achieved considerable performance on various complex question-answering tasks. Despite the incessant exploration in this field, existing language agent systems still struggle with costly, non-reproducible data reliance and face the challenge of compelling a single model for multiple functions. To this end, we introduce AutoAct, an automatic agent learning framework that does not rely on large-scale annotated data and synthetic trajectories from closed-source models (e.g., GPT-4). Given limited data with a tool library, AutoAct first automatically synthesizes planning trajectories without any assistance from humans or strong closed-source models. Then, AutoAct leverages a division-of-labor strategy to automatically differentiate based on the target task information and synthesized trajectories, producing a sub-agent group to complete the task. We conduct comprehensive experiments with different LLMs, which demonstrates that AutoAct yields better or parallel performance compared to various strong baselines. Further analysis demonstrates the effectiveness of the division-of-labor strategy, with the trajectory quality generated by AutoAct significantly outperforming that of others. Code will be available at
Computation and Language,Artificial Intelligence,Human-Computer Interaction,Machine Learning,Multiagent Systems
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address several key issues faced by language agents when performing complex tasks: 1. **High Data Dependency**: Existing language agent systems rely on large-scale annotated data and trajectories synthesized by closed-source models, leading to high costs and reproducibility issues. 2. **Multi-task Capability of a Single Model**: Current methods typically require a single model to possess multiple functions, increasing the model's burden and making it difficult to efficiently complete multiple tasks. 3. **Limitations in Planning Capability**: Existing methods often depend on closed-source models or large amounts of annotated data to endow language models with planning capabilities, limiting the application of open-source models. To address these issues, the authors introduce **AUTOACT**, an automatic agent learning framework. This framework does not rely on large-scale annotated data and trajectories synthesized by closed-source models. Instead, it enables agents to automatically synthesize planning trajectories and decompose tasks through Self-Planning and a Division-of-Labor Strategy, thereby improving the efficiency and accuracy of task completion.