Abstract:From ancient water wheels to robotic process automation (RPA), automation technology has evolved throughout history to liberate human beings from arduous tasks. Yet, RPA struggles with tasks needing human-like intelligence, especially in elaborate design of workflow construction and dynamic decision-making in workflow execution. As Large Language Models (LLMs) have emerged human-like intelligence, this paper introduces Agentic Process Automation (APA), a groundbreaking automation paradigm using LLM-based agents for advanced automation by offloading the human labor to agents associated with construction and execution. We then instantiate ProAgent, an LLM-based agent designed to craft workflows from human instructions and make intricate decisions by coordinating specialized agents. Empirical experiments are conducted to detail its construction and execution procedure of workflow, showcasing the feasibility of APA, unveiling the possibility of a new paradigm of automation driven by agents. Our code is public at <a class="link-external link-https" href="https://github.com/OpenBMB/ProAgent" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

The paper aims to address the limitations of current Robotic Process Automation (RPA) technology in handling tasks that require human intelligence, particularly in the dynamic decision-making issues within complex workflow design and execution. The paper proposes a new automation paradigm—Agentic Process Automation (APA), which leverages Large Language Models (LLMs) to create workflows capable of automatically constructing and executing complex tasks. Specifically, APA addresses the issues of existing RPA technology through the following two aspects: 1. **Agentic Workflow Construction**: Upon receiving human instructions or requirements, an LLM-based agent can construct the corresponding workflow without the need for manual intervention in complex design. 2. **Agentic Workflow Execution**: During the execution of the workflow, when dynamic decision-making is required, the agent intervenes to handle these decisions, including complex data processing in the data flow and complex conditional branches and loops in the control flow. To validate the feasibility of the APA concept, the researchers developed an instantiated LLM-based agent—PROAGENT. PROAGENT can generate the corresponding workflow upon receiving human instructions and handle the dynamic decision-making parts during workflow execution. The paper also introduces a language for describing workflows—Agentic Workflow Description Language, which is based on JSON structure and Python code, making it easy for LLM to understand and generate workflows. Additionally, the paper proposes two types of agents: DataAgent, responsible for handling complex data flows, and ControlAgent, which acts as a conditional expression to dynamically determine subsequent branches. Finally, proof-of-concept experiments demonstrated PROAGENT's ability to construct workflows based on human instructions and utilize agents to handle dynamic decision-making parts during execution. These experiments were based on an open-source workflow platform n8n, showcasing how PROAGENT works in commercial scenarios, such as automatically sending messages to Slack or drafting and sending emails to the appropriate managers based on different types of business lines (customer-facing or enterprise-facing).

ProAgent: From Robotic Process Automation to Agentic Process Automation

ProAgent: Building Proactive Cooperative Agents with Large Language Models

Agent S: An Open Agentic Framework that Uses Computers Like a Human

AutoAct: Automatic Agent Learning from Scratch for QA Via Self-Planning

Proactive Agent: Shifting LLM Agents from Reactive Responses to Active Assistance

AutoFlow: Automated Workflow Generation for Large Language Model Agents

A Multi-AI Agent System for Autonomous Optimization of Agentic AI Solutions via Iterative Refinement and LLM-Driven Feedback Loops

BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration

Professional Agents -- Evolving Large Language Models into Autonomous Experts with Human-Level Competencies

A multi-agent-driven robotic AI chemist enabling autonomous chemical research on demand

AppAgent v2: Advanced Agent for Flexible Mobile Interactions

Intelligent Agents with LLM-based Process Automation

AssistantX: An LLM-Powered Proactive Assistant in Collaborative Human-Populated Environment

Agents: An Open-source Framework for Autonomous Language Agents

CAAP: Context-Aware Action Planning Prompting to Solve Computer Tasks with Front-End UI Only

Automated Design of Agentic Systems

Turn Every Application into an Agent: Towards Efficient Human-Agent-Computer Interaction with API-First LLM-Based Agents

GraphAgent: Agentic Graph Language Assistant

OpenWebAgent: An Open Toolkit to Enable Web Agents on Large Language Models

Enhancing AI Systems with Agentic Workflows Patterns in Large Language Model