Abstract:The advent of large language models (LLMs) such as ChatGPT, PaLM, and GPT-4 has catalyzed remarkable advances in natural language processing, demonstrating human-like language fluency and reasoning capacities. This position paper introduces the concept of Professional Agents (PAgents), an application framework harnessing LLM capabilities to create autonomous agents with controllable, specialized, interactive, and professional-level competencies. We posit that PAgents can reshape professional services through continuously developed expertise. Our proposed PAgents framework entails a tri-layered architecture for genesis, evolution, and synergy: a base tool layer, a middle agent layer, and a top synergy layer. This paper aims to spur discourse on promising real-world applications of LLMs. We argue the increasing sophistication and integration of PAgents could lead to AI systems exhibiting professional mastery over complex domains, serving critical needs, and potentially achieving artificial general intelligence.

What problem does this paper attempt to address?

The paper aims to explore how to construct professional-level autonomous intelligent agents (Professional Agents, abbreviated as PAgents) through large language models (LLMs) to achieve artificial general intelligence (AGI). Specifically, the paper proposes a three-layer architectural framework to generate, evolve, and collaborate these professional agents: 1. **Basic Tools Layer**: This layer provides a wide range of technical toolsets, including basic tools, development resources, knowledge bases, search capabilities, and AI systems, laying the foundation for the infrastructure of professional agents and supporting their continuous improvement. 2. **Intermediate Agent Layer**: This layer contains a series of independent, controllable, and interactive professional agents, each with specific professional roles and the key capabilities required. 3. **Top Collaboration Layer**: This layer integrates various professional agents into a comprehensive network, collaboratively handling complex tasks, similar to the cooperation model of human professional teams. The paper also details the four core components of PAgents: the role module (defining the professional identity and capabilities of the agent), the perception module (handling multimodal data inputs), the brain module (responsible for advanced functions such as cognition, memory, planning, and reasoning), and the action module (translating decisions into concrete outcomes). Additionally, the paper proposes a series of steps for constructing and evolving PAgents, including defining professional roles, developing perception modules, establishing brain modules, constructing action modules, integration testing, deployment monitoring, and continuous learning and adaptation. The ultimate goal is for PAgents to demonstrate capabilities similar to or even surpassing human experts in various professional fields, thereby driving revolutionary changes in the professional services sector.

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

Autonomous Agents in Software Development: A Vision Paper

Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects

The Rise and Potential of Large Language Model Based Agents: A Survey

Enhancing Pipeline-Based Conversational Agents with Large Language Models

Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents

TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage

Agents: An Open-source Framework for Autonomous Language Agents

Exploring Autonomous Agents through the Lens of Large Language Models: A Review

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Transforming Agency. On the mode of existence of Large Language Models

TrainerAgent: Customizable and Efficient Model Training Through LLM-Powered Multi-Agent System.

Artificial Agency and Large Language Models

STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

ProAgent: Building Proactive Cooperative Agents with Large Language Models

TPTU: Task Planning and Tool Usage of Large Language Model-based AI Agents

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

ProAgent: From Robotic Process Automation to Agentic Process Automation

ExpeL: LLM Agents Are Experiential Learners