ProAgent: From Robotic Process Automation to Agentic Process Automation

Yining Ye,Xin Cong,Shizuo Tian,Jiannan Cao,Hao Wang,Yujia Qin,Yaxi Lu,Heyang Yu,Huadong Wang,Yankai Lin,Zhiyuan Liu,Maosong Sun
2023-11-23
Abstract:From ancient water wheels to robotic process automation (RPA), automation technology has evolved throughout history to liberate human beings from arduous tasks. Yet, RPA struggles with tasks needing human-like intelligence, especially in elaborate design of workflow construction and dynamic decision-making in workflow execution. As Large Language Models (LLMs) have emerged human-like intelligence, this paper introduces Agentic Process Automation (APA), a groundbreaking automation paradigm using LLM-based agents for advanced automation by offloading the human labor to agents associated with construction and execution. We then instantiate ProAgent, an LLM-based agent designed to craft workflows from human instructions and make intricate decisions by coordinating specialized agents. Empirical experiments are conducted to detail its construction and execution procedure of workflow, showcasing the feasibility of APA, unveiling the possibility of a new paradigm of automation driven by agents. Our code is public at <a class="link-external link-https" href="https://github.com/OpenBMB/ProAgent" rel="external noopener nofollow">this https URL</a>.
Robotics,Artificial Intelligence,Computation and Language
What problem does this paper attempt to address?
The paper aims to address the limitations of current Robotic Process Automation (RPA) technology in handling tasks that require human intelligence, particularly in the dynamic decision-making issues within complex workflow design and execution. The paper proposes a new automation paradigm—Agentic Process Automation (APA), which leverages Large Language Models (LLMs) to create workflows capable of automatically constructing and executing complex tasks. Specifically, APA addresses the issues of existing RPA technology through the following two aspects: 1. **Agentic Workflow Construction**: Upon receiving human instructions or requirements, an LLM-based agent can construct the corresponding workflow without the need for manual intervention in complex design. 2. **Agentic Workflow Execution**: During the execution of the workflow, when dynamic decision-making is required, the agent intervenes to handle these decisions, including complex data processing in the data flow and complex conditional branches and loops in the control flow. To validate the feasibility of the APA concept, the researchers developed an instantiated LLM-based agent—PROAGENT. PROAGENT can generate the corresponding workflow upon receiving human instructions and handle the dynamic decision-making parts during workflow execution. The paper also introduces a language for describing workflows—Agentic Workflow Description Language, which is based on JSON structure and Python code, making it easy for LLM to understand and generate workflows. Additionally, the paper proposes two types of agents: DataAgent, responsible for handling complex data flows, and ControlAgent, which acts as a conditional expression to dynamically determine subsequent branches. Finally, proof-of-concept experiments demonstrated PROAGENT's ability to construct workflows based on human instructions and utilize agents to handle dynamic decision-making parts during execution. These experiments were based on an open-source workflow platform n8n, showcasing how PROAGENT works in commercial scenarios, such as automatically sending messages to Slack or drafting and sending emails to the appropriate managers based on different types of business lines (customer-facing or enterprise-facing).