Abstract:This paper introduces Investigate-Consolidate-Exploit (ICE), a novel strategy for enhancing the adaptability and flexibility of AI agents through inter-task self-evolution. Unlike existing methods focused on intra-task learning, ICE promotes the transfer of knowledge between tasks for genuine self-evolution, similar to human experience learning. The strategy dynamically investigates planning and execution trajectories, consolidates them into simplified workflows and pipelines, and exploits them for improved task execution. Our experiments on the XAgent framework demonstrate ICE's effectiveness, reducing API calls by as much as 80% and significantly decreasing the demand for the model's capability. Specifically, when combined with GPT-3.5, ICE's performance matches that of raw GPT-4 across various agent tasks. We argue that this self-evolution approach represents a paradigm shift in agent design, contributing to a more robust AI community and ecosystem, and moving a step closer to full autonomy.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the adaptability and flexibility of AI agents when handling new tasks through inter - task self - evolution. Existing methods mainly focus on intra - task learning, while this paper proposes a new strategy - Investigate - Consolidate - Exploit (ICE), aiming to promote knowledge transfer between different tasks, thereby achieving true self - evolution, similar to the human experience - learning process. The ICE strategy dynamically investigates planning and execution trajectories, simplifies them into workflows and pipelines, and utilizes them to improve task execution efficiency. Specifically, the paper proposes the following points to solve the problem: 1. **Cross - task learning**: Different from existing methods, the ICE strategy emphasizes cross - task knowledge transfer, which helps agents not only perform well on a single task but also apply their experience to new tasks, improving their overall performance. 2. **Dynamic investigation**: The ICE strategy identifies experiences worthy of learning and reference by dynamically tracking the planning and execution processes of tasks. This includes monitoring the generation of initial plans, any plan revisions, and the status of each goal or sub - goal. 3. **Experience consolidation**: The ICE strategy standardizes successful plans and execution trajectories to form workflows and pipelines that are easy to reuse in the future. This involves removing failed goals or sub - goals and converting successful goals into a linear structure for easy learning and reference. 4. **Experience exploitation**: The ICE strategy improves the planning and execution efficiency of new tasks by retrieving and exploiting past - consolidated experiences. This includes retrieving plans for similar goals during initial plan generation and plan revision, and directly applying relevant pipelines when executing new tasks. Through these strategies, the paper demonstrates the effectiveness of the ICE method, especially in reducing the number of API calls, lowering the requirements for model capabilities, and increasing the task completion rate. Experimental results show that the ICE strategy can significantly improve the task execution efficiency and the rationality of plan formulation of agents, and can even achieve performance comparable to that of stronger models (such as GPT - 4) when using weaker models (such as GPT - 3.5).

Investigate-Consolidate-Exploit: A General Strategy for Inter-Task Agent Self-Evolution

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Imagine, Initialize, and Explore: An Effective Exploration Method in Multi-Agent Reinforcement Learning

In-Context Exploiter for Extensive-Form Games

Autonomous Agents in Software Development: A Vision Paper

Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models

Interesting Object, Curious Agent: Learning Task-Agnostic Exploration

Evolving Agents: Interactive Simulation of Dynamic and Diverse Human Personalities

In-context Exploration-Exploitation for Reinforcement Learning

InterAct: Exploring the Potentials of ChatGPT as a Cooperative Agent

Self-Motivated Multi-Agent Exploration

Symbolic Learning Enables Self-Evolving Agents

Influence-Based Multi-Agent Exploration

Progressively Efficient Learning

Long Term Memory: The Foundation of AI Self-Evolution

Theory of Collective Intelligence Evolution and Its Applications in Intelligent Robots

MobileExperts: A Dynamic Tool-Enabled Agent Team in Mobile Devices

AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization