AUTOACT: Automatic Agent Learning from Scratch via Self-Planning

Shuofei Qiao,Ningyu Zhang,Runnan Fang,Yujie Luo,Wangchunshu Zhou,Yuchen Eleanor Jiang,Chengfei Lv,Huajun Chen

2024-02-17

Abstract:Language agents have achieved considerable performance on various complex question-answering tasks. Despite the incessant exploration in this field, existing language agent systems still struggle with costly, non-reproducible data reliance and face the challenge of compelling a single model for multiple functions. To this end, we introduce AutoAct, an automatic agent learning framework that does not rely on large-scale annotated data and synthetic trajectories from closed-source models (e.g., GPT-4). Given limited data with a tool library, AutoAct first automatically synthesizes planning trajectories without any assistance from humans or strong closed-source models. Then, AutoAct leverages a division-of-labor strategy to automatically differentiate based on the target task information and synthesized trajectories, producing a sub-agent group to complete the task. We conduct comprehensive experiments with different LLMs, which demonstrates that AutoAct yields better or parallel performance compared to various strong baselines. Further analysis demonstrates the effectiveness of the division-of-labor strategy, with the trajectory quality generated by AutoAct significantly outperforming that of others. Code will be available at

Computation and Language,Artificial Intelligence,Human-Computer Interaction,Machine Learning,Multiagent Systems

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve The paper aims to address several key issues faced by language agents when performing complex tasks: 1. **High Data Dependency**: Existing language agent systems rely on large-scale annotated data and trajectories synthesized by closed-source models, leading to high costs and reproducibility issues. 2. **Multi-task Capability of a Single Model**: Current methods typically require a single model to possess multiple functions, increasing the model's burden and making it difficult to efficiently complete multiple tasks. 3. **Limitations in Planning Capability**: Existing methods often depend on closed-source models or large amounts of annotated data to endow language models with planning capabilities, limiting the application of open-source models. To address these issues, the authors introduce **AUTOACT**, an automatic agent learning framework. This framework does not rely on large-scale annotated data and trajectories synthesized by closed-source models. Instead, it enables agents to automatically synthesize planning trajectories and decompose tasks through Self-Planning and a Division-of-Labor Strategy, thereby improving the efficiency and accuracy of task completion.

AUTOACT: Automatic Agent Learning from Scratch via Self-Planning

AutoAct: Automatic Agent Learning from Scratch for QA Via Self-Planning

AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML

Agents: An Open-source Framework for Autonomous Language Agents

AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning

KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents

Towards Autonomous Agents: Adaptive-planning, Reasoning, and Acting in Language Models

ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy

Ask-before-Plan: Proactive Language Agents for Real-World Planning

PlanAgent: A Multi-modal Large Language Agent for Closed-loop Vehicle Motion Planning

A Language Agent for Autonomous Driving

AutoDroid-V2: Boosting SLM-based GUI Agents via Code Generation

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Empowering Large Language Model Agents through Action Learning

Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents

You Only Look at Screens: Multimodal Chain-of-Action Agents

DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production

Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

ProAgent: Building Proactive Cooperative Agents with Large Language Models