Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

Zhixuan Chu,Yan Wang,Feng Zhu,Lu Yu,Longfei Li,Jinjie Gu
2024-02-06
Abstract:The advent of large language models (LLMs) such as ChatGPT, PaLM, and GPT-4 has catalyzed remarkable advances in natural language processing, demonstrating human-like language fluency and reasoning capacities. This position paper introduces the concept of Professional Agents (PAgents), an application framework harnessing LLM capabilities to create autonomous agents with controllable, specialized, interactive, and professional-level competencies. We posit that PAgents can reshape professional services through continuously developed expertise. Our proposed PAgents framework entails a tri-layered architecture for genesis, evolution, and synergy: a base tool layer, a middle agent layer, and a top synergy layer. This paper aims to spur discourse on promising real-world applications of LLMs. We argue the increasing sophistication and integration of PAgents could lead to AI systems exhibiting professional mastery over complex domains, serving critical needs, and potentially achieving artificial general intelligence.
Computation and Language
What problem does this paper attempt to address?
The paper aims to explore how to construct professional-level autonomous intelligent agents (Professional Agents, abbreviated as PAgents) through large language models (LLMs) to achieve artificial general intelligence (AGI). Specifically, the paper proposes a three-layer architectural framework to generate, evolve, and collaborate these professional agents: 1. **Basic Tools Layer**: This layer provides a wide range of technical toolsets, including basic tools, development resources, knowledge bases, search capabilities, and AI systems, laying the foundation for the infrastructure of professional agents and supporting their continuous improvement. 2. **Intermediate Agent Layer**: This layer contains a series of independent, controllable, and interactive professional agents, each with specific professional roles and the key capabilities required. 3. **Top Collaboration Layer**: This layer integrates various professional agents into a comprehensive network, collaboratively handling complex tasks, similar to the cooperation model of human professional teams. The paper also details the four core components of PAgents: the role module (defining the professional identity and capabilities of the agent), the perception module (handling multimodal data inputs), the brain module (responsible for advanced functions such as cognition, memory, planning, and reasoning), and the action module (translating decisions into concrete outcomes). Additionally, the paper proposes a series of steps for constructing and evolving PAgents, including defining professional roles, developing perception modules, establishing brain modules, constructing action modules, integration testing, deployment monitoring, and continuous learning and adaptation. The ultimate goal is for PAgents to demonstrate capabilities similar to or even surpassing human experts in various professional fields, thereby driving revolutionary changes in the professional services sector.