SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Ruihan Yang,Jiangjie Chen,Yikai Zhang,Siyu Yuan,Aili Chen,Kyle Richardson,Yanghua Xiao,Deqing Yang
2024-06-07
Abstract:Language agents powered by large language models (LLMs) are increasingly valuable as decision-making tools in domains such as gaming and programming. However, these agents often face challenges in achieving high-level goals without detailed instructions and in adapting to environments where feedback is delayed. In this paper, we present SelfGoal, a novel automatic approach designed to enhance agents' capabilities to achieve high-level goals with limited human prior and environmental feedback. The core concept of SelfGoal involves adaptively breaking down a high-level goal into a tree structure of more practical subgoals during the interaction with environments while identifying the most useful subgoals and progressively updating this structure. Experimental results demonstrate that SelfGoal significantly enhances the performance of language agents across various tasks, including competitive, cooperative, and deferred feedback environments. Project page: <a class="link-external link-https" href="https://selfgoal-agent.github.io" rel="external noopener nofollow">this https URL</a>.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to enable autonomous language agents to consistently achieve high - level goals without frequent retraining. Specifically, existing large - language models (LLMs), as decision - making tools, are increasingly valuable in fields such as gaming and programming, but these agents have difficulty achieving high - level goals without detailed instructions and have limited adaptability in environments with feedback delays. The paper proposes a new method named SELFGOAL, which aims to enhance the capabilities of agents by adaptively decomposing high - level goals into more practical sub - goal tree structures, while identifying the most useful sub - goals and gradually updating this structure, thereby significantly improving the performance of language agents in a variety of tasks, including tasks in competitive, cooperative, and delayed - feedback environments.