search
HomeTechnology peripheralsAIProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

  • Project address: https://github.com/OpenBMB/ProAgent
  • Paper address: https://github.com/OpenBMB/ProAgent/blob/main/paper/paper.pdf

In the development of human technology Throughout history, automation has been the main driving force, helping humans to free themselves from complex, dangerous, and tedious labor environments. From waterwheel irrigation in the early agricultural era to steam engines in the industrial era, humans have been constantly pursuing more advanced automation technologies to liberate themselves from heavy work

With the information age With the arrival of , software, as the basis for information processing, storage and communication, has become an inseparable part of human production and life, thus catalyzing the formation of Robotic Process Automation (RPA) technology. It coordinates multiple software into a solidified workflow (Workflow) through manually compiled rules, and interacts with software to achieve efficient execution by simulating human interaction.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

In this diagram we compare Robotic Process Automation (RPA) with Agent Process Automation (APA)

RPA (Robotic Process Automation) uses software robots or "BOTs" to simulate and perform repetitive and regular tasks to free up human resources and improve work efficiency. The application range of RPA is very wide. Many enterprises (including banks, insurance companies, manufacturing, retail and other industries) usually use RPA robots to automate routine and tedious tasks, such as data entry, data extraction, and data processing. By automating tasks, RPA can significantly reduce error rates and be able to perform tasks 24*7, thereby improving business reliability and responsiveness

According to market research, the RPA market is growing rapidly and achieving great success. Gartner predicts that global RPA market revenue will reach US$3.3 billion by 2023, with a growth rate of 17.5%. This shows that enterprises have a very high demand and recognition for RPA

However, RPA can only replace simple, mechanical human work, and some complex processes still rely on manual labor:

  1. Writing RPA workflow itself requires heavy human labor and is costly.
  2. Complex tasks are very flexible and usually involve dynamic decision-making, which is difficult to solidify into rules for expression.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

Figure 2 Comparison of efficiency and intelligence between RPA and APA

Fortunately, The recent emergence of large language model agent technology (Large Language Model based Agents, LLM-based Agents) in the field of AI may create new possibilities for automation technology. Is it possible to introduce the flexibility of Agent technology into the RPA field to further reduce human participation?

The team's research explores the new automation paradigm "Agentic Process Automation" (APA) in the era of large-model agents. Compared with traditional RPA, in the APA paradigm, the Agent can autonomously complete the workflow construction according to human needs. At the same time, it can identify the parts of human needs that require dynamic decision-making, automatically orchestrate them into the workflow, and execute the workflow when the workflow is executed. This part actively takes over the execution of the workflow to complete corresponding complex decisions.

In order to explore the possibilities of APA, this research work implemented an automated agent ProAgent, which can receive human instructions and build workflows by generating code while also being in the workflow DataAgent and ControlAgent are introduced to implement complex data processing and logical control in workflow. ProAgent's research demonstrates the feasibility of APA in the era of large-model agents, and also reveals new possibilities for automation technology in the era of LLM.

Method introduction

In RPA, the workflow is a graph structure composed of a series of tool calls: nodes represent atomic tool calls (such as Gmail, Twitter, Google Sheets), while edges represent the logical sequence of execution (connection, branch, loop). A workflow usually contains all prior knowledge of a task or a type of task, including problem solving paths and exception handling logic. Therefore, writing fixed workflows is often very stable, thorough and efficient

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

Figure 3 Example of agent workflow description language

In ProAgent, since LLM itself is pre-trained in the code data , and learned strong coding capabilities, this research is based on the code-based Agentic Workflow Description Language. This language uses JSON to organize and manage data in the workflow, and uses Python syntax to implement logical control of the workflow. Jumps, loops, etc. in the control flow are directly represented through Python syntax, while the tools in the workflow are The call is encapsulated as a Python Function. So for ProAgent, workflow building tasks are transformed into code generation tasks. When receiving human instructions, ProAgent writes the corresponding Agentic Workflow Description Language, thereby realizing automated workflow construction.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

Figure 4 Example of agent workflow description language combining DataAgent and ControlAgent

Complex reality Tasks usually involve dynamic decision-making, and simple Python-style logic control rules and JSON-style data organization are ineffective when facing flexible needs. At this time, agents need to be introduced. Therefore, this research work further defines two Agent operations:

1. DataAgent: For a complex data processing requirement, natural language will be used to describe the processing when building the workflow. The task will then initialize a DataAgent when executed, which will autonomously process and complete the data processing task based on the natural language description.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

2. ControlAgent: For logical control rules that are difficult to express with rules, use natural language to describe the control logic when building the workflow, and then A ControlAgent will be initialized at runtime, which will autonomously select the branch that needs to be executed later in the workflow based on the natural language description.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

ProAgent uses ReACT mode to build workflow step by step, which contains four workflow construction steps:

  1. Action_Define: Decide what tools to add to the workflow.
  2. Action Implement: Convert the input/output parameters of the tool into a JSON structure, and encapsulate the call of the tool into a Python function.
  3. Workflow Implement: Define a mainWorkflow function to organize the logic control and data processing of the entire workflow.
  4. Task Submit: When ProAgent completes building the workflow, this operation identifies the end of the build process.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

The example shows Figure 5 of the ProAgent workflow building process

In addition, In order to optimize the effect of ProAgent, several optimization techniques are introduced:

  1. 1.Testing-on-Constructing: During the construction process, ProAgent will modify the workflow once Test the workflow to ensure its correctness.
  2. Function Calling: All operations of workflow construction are encapsulated into GPT-4 Functions, thereby improving control over the workflow construction process.
  3. Chain-of-Thought: When ProAgent writes workflow code, it needs to give comments and a writing plan for each function to improve the performance of ProAgent workflow construction. .

The workflow execution process is based on the Python interpreter. When a workflow is given, the corresponding mainWorkflow function is used as the entry point for execution, thus starting the entire execution process. The execution process follows the execution rules of Python code, that is, it is executed line by line in order. Once the mainWorkflow function returns, execution of the workflow has completed successfully

Feasibility Verification

In order to verify the feasibility of Agentic Process Automation, this research uses OpenAI GPT-4 as the basic model and an open source RPA platform n8n as The carrier implements the above-mentioned ProAgent. At the same time, we designed a task that requires both flexibility and efficiency: this is a typical business scenario, which requires extracting profit data of various business lines from Google Sheets, and determining subsequent actions based on whether the business is 2B or 2C. Once the line of business is determined to be 2C, a message is sent to the Slack channel. For business lines in 2B, an email is sent to the respective manager, which includes an assessment of the business line and a brief profitability overview.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

Figure 6 Task Instruction Display

The content that needs to be rewritten is: For this task , First of all, it is a repetitive task, and the same process should be adopted for multiple product lines. Secondly, it is very difficult to distinguish whether a business line is 2C or 2B, and it requires dynamic decision-making by the Agent to determine the subsequent workflow. Finally, writing the evaluation email of the business line requires a certain amount of intelligence, so the intervention of the Agent is required

In the ProAgent generation, for this task, a program containing four atomic operations was written. Workflow for a DataAgent and a ControlAgent. The overall process is roughly as shown in the figure below:

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universitiesFigure 7 ProAgent workflow construction process display

It can be seen that ProAgent automatically The way of writing code automatically completes the workflow construction process without manual intervention. When it is necessary to determine whether the business line is 2B or 2C, ProAgent introduces ControlAgent to make the judgment. The Prompt of ControlAgent is set to "Decide Whether the business line is toC or toB". When the business line is 2B, ProAgent also introduces a DataAgent, whose task is set to "Write an email of the business line of profit, together with your suggestion", thus using the intelligence of the agent to write based on the actual situation of different business lines. mail.

After the workflow is written and solidified, the workflow will automatically branch to different logic according to different data for efficient data processing.

ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities

Figure 8 ProAgent workflow execution process display

When processing 2C business line data, ControlAgent You can determine the type of the current business line based on the business line description and choose to use the Slack tool for communication. When processing 2B business line data, DataAgent can compose an email and send it to the corresponding manager's mailbox

Summary

This study proposes A new automation paradigm - Agentic Process Automation is developed, suitable for the era of large models. Compared with traditional Robotic Process Automation technology, Agentic Process Automation can automate the construction of workflows and realize the automation of dynamic decisions during workflow execution. The research also further developed ProAgent and experimentally demonstrated the feasibility and potential of large-model agents in automation. I believe that in the future, large model agent technology will help humans achieve a higher level of automation and liberate themselves from heavy labor

Currently, the research team has conducted many studies in the direction of large model agents, including:

  • XAgent: a super powerful model agent application framework that can dismantle complex tasks on its own. and execute efficiently.
  • Project address: https://github.com/OpenBMB/XAgent
  • ChatDev: a multi-agent collaborative development framework that allows multiple Agents with different roles collaborate to automatically develop software applications.
  • Project address: https://github.com/OpenBMB/ChatDev
  • AgentVerse: A general platform for large model-driven agents, recruiting A variety of agent experts work together to help users solve complex tasks.
  • Project address: https://github.com/OpenBMB/AgentVerse

The above is the detailed content of ProAgent: Intelligent agents led by OpenAI liberate manpower, released by Tsinghua University and other universities. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
解读CRISP-ML(Q):机器学习生命周期流程解读CRISP-ML(Q):机器学习生命周期流程Apr 08, 2023 pm 01:21 PM

译者 | 布加迪审校 | 孙淑娟目前,没有用于构建和管理机器学习(ML)应用程序的标准实践。机器学习项目组织得不好,缺乏可重复性,而且从长远来看容易彻底失败。因此,我们需要一套流程来帮助自己在整个机器学习生命周期中保持质量、可持续性、稳健性和成本管理。图1. 机器学习开发生命周期流程使用质量保证方法开发机器学习应用程序的跨行业标准流程(CRISP-ML(Q))是CRISP-DM的升级版,以确保机器学习产品的质量。CRISP-ML(Q)有六个单独的阶段:1. 业务和数据理解2. 数据准备3. 模型

人工智能的环境成本和承诺人工智能的环境成本和承诺Apr 08, 2023 pm 04:31 PM

人工智能(AI)在流行文化和政治分析中经常以两种极端的形式出现。它要么代表着人类智慧与科技实力相结合的未来主义乌托邦的关键,要么是迈向反乌托邦式机器崛起的第一步。学者、企业家、甚至活动家在应用人工智能应对气候变化时都采用了同样的二元思维。科技行业对人工智能在创建一个新的技术乌托邦中所扮演的角色的单一关注,掩盖了人工智能可能加剧环境退化的方式,通常是直接伤害边缘人群的方式。为了在应对气候变化的过程中充分利用人工智能技术,同时承认其大量消耗能源,引领人工智能潮流的科技公司需要探索人工智能对环境影响的

找不到中文语音预训练模型?中文版 Wav2vec 2.0和HuBERT来了找不到中文语音预训练模型?中文版 Wav2vec 2.0和HuBERT来了Apr 08, 2023 pm 06:21 PM

Wav2vec 2.0 [1],HuBERT [2] 和 WavLM [3] 等语音预训练模型,通过在多达上万小时的无标注语音数据(如 Libri-light )上的自监督学习,显著提升了自动语音识别(Automatic Speech Recognition, ASR),语音合成(Text-to-speech, TTS)和语音转换(Voice Conversation,VC)等语音下游任务的性能。然而这些模型都没有公开的中文版本,不便于应用在中文语音研究场景。 WenetSpeech [4] 是

条形统计图用什么呈现数据条形统计图用什么呈现数据Jan 20, 2021 pm 03:31 PM

条形统计图用“直条”呈现数据。条形统计图是用一个单位长度表示一定的数量,根据数量的多少画成长短不同的直条,然后把这些直条按一定的顺序排列起来;从条形统计图中很容易看出各种数量的多少。条形统计图分为:单式条形统计图和复式条形统计图,前者只表示1个项目的数据,后者可以同时表示多个项目的数据。

自动驾驶车道线检测分类的虚拟-真实域适应方法自动驾驶车道线检测分类的虚拟-真实域适应方法Apr 08, 2023 pm 02:31 PM

arXiv论文“Sim-to-Real Domain Adaptation for Lane Detection and Classification in Autonomous Driving“,2022年5月,加拿大滑铁卢大学的工作。虽然自主驾驶的监督检测和分类框架需要大型标注数据集,但光照真实模拟环境生成的合成数据推动的无监督域适应(UDA,Unsupervised Domain Adaptation)方法则是低成本、耗时更少的解决方案。本文提出对抗性鉴别和生成(adversarial d

数据通信中的信道传输速率单位是bps,它表示什么数据通信中的信道传输速率单位是bps,它表示什么Jan 18, 2021 pm 02:58 PM

数据通信中的信道传输速率单位是bps,它表示“位/秒”或“比特/秒”,即数据传输速率在数值上等于每秒钟传输构成数据代码的二进制比特数,也称“比特率”。比特率表示单位时间内传送比特的数目,用于衡量数字信息的传送速度;根据每帧图像存储时所占的比特数和传输比特率,可以计算数字图像信息传输的速度。

聊一聊Python 实现数据的序列化操作聊一聊Python 实现数据的序列化操作Apr 12, 2023 am 09:31 AM

​在日常开发中,对数据进行序列化和反序列化是常见的数据操作,Python提供了两个模块方便开发者实现数据的序列化操作,即 json 模块和 pickle 模块。这两个模块主要区别如下:json 是一个文本序列化格式,而 pickle 是一个二进制序列化格式;json 是我们可以直观阅读的,而 pickle 不可以;json 是可互操作的,在 Python 系统之外广泛使用,而 pickle 则是 Python 专用的;默认情况下,json 只能表示 Python 内置类型的子集,不能表示自定义的

数据分析方法有哪几种数据分析方法有哪几种Dec 15, 2020 am 09:48 AM

数据分析方法有4种,分别是:1、趋势分析,趋势分析一般用于核心指标的长期跟踪;2、象限分析,可依据数据的不同,将各个比较主体划分到四个象限中;3、对比分析,分为横向对比和纵向对比;4、交叉分析,主要作用就是从多个维度细分数据。

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools