search
HomeTechnology peripheralsAIAgents divide work and collaborate like people, and can also exchange information through 'group chat'

Intelligent agents must also have a "specification manual"!

A study called MetaGPT, by clearly dividing the roles of agents and requiring multiple agents to adopt a unified and standardized "communication format" in collaboration, allows The performance of the agent is greatly improved.

Currently, this research has garnered 33.6k stars on GitHub, and was included as an Oral paper at the top deep learning conference ICLR 2024.

Agents divide work and collaborate like people, and can also exchange information through group chat

In general, MetaGPT imitates human division of labor and collaboration, and encodes the standard operating procedures of various tasks into the "standards" of intelligent agents. Manual", agents with different roles are responsible for different professional tasks.

For example, the product manager role can use network search tools, while the engineer role can execute code:

Agents divide work and collaborate like people, and can also exchange information through group chat

In this way, multi-agent collaboration completes the task.

The researchers set up a "message sharing group" for the agents, and the agents can freely view relevant messages sent by other agents.

After testing, using this method, MetaGPT achieved 85.9% and 87.7% new SOTA respectively on the public data sets HumanEval and MBPP for code completion tasks.

Currently this work has been open sourced and has attracted the attention of many netizens across the Internet:

Agents divide work and collaborate like people, and can also exchange information through group chat

What does MetaGPT look like?

This research was jointly proposed by the DeepWisdom team and scholars from KAUST AI Center, Xiamen University, CUHK(SZ), Nanjing University, UPenn, UCB and many other universities and institutions.

Agents divide work and collaborate like people, and can also exchange information through group chat

# As the capabilities of large models continue to improve, there is growing interest in academia and industry in using large model-based agents to solve various tasks.

It is worth noting that research on using multiple agents to collaborate to solve problems in specific fields is still in its early stages. Existing research mainly focuses on enhancing task understanding and reasoning decision-making capabilities through role-playing mechanisms and communication topology settings. Despite some progress, these methods still rely on direct dialogue forms and lack standard specifications and constraints on agent behavior.

Some recent work has also pointed out that multi-agent systems based on dialogue may face problems such as information inconsistency, ambiguity, and possible invalid repetition and infinite loops.

In contrast, standard operating procedures (SOPs) in human workflows not only clearly define the division of labor and topology of participating roles, but also establish standard specifications for the role's output results .

Research shows that clearly defined SOPs can improve the consistency and accuracy of task execution and ensure that the end result meets specified quality standards. Therefore, to solve the challenges in multi-agent collaboration, researchers designed a large model-based agent meta-programming framework MetaGPT.

MetaGPT requires agents to participate in collaboration as experts and generate structured output as required, such as high-quality requirements documents, architectural design diagrams, and flow charts.

The structured output is a higher-level thinking chain(Chain-of-Thought) for a single agent, and a context with clear semantics and clear goals for downstream roles(Context).

In the framework of MetaGPT, researchers aligned the concepts of SOPs to role specialization, communication protocol design, and iterative executable feedback design.

Role Specialization

Through clearly defined roles, complex work can be broken down into smaller, more specific tasks.

As shown in the figure below, different professional roles are initialized with different goals and constraints, as well as different professional skills. For example, the product manager role can use web search tools, while the engineer role can execute code. At the same time, each character follows the ReAct behavior pattern by default.

Agents divide work and collaborate like people, and can also exchange information through group chat

#Role specialization enables each agent to focus on specific tasks within its domain, thereby improving the output quality of large models.

For software development, through the flow of roles, this division of labor more skillfully completes the alignment from natural language to programming language. The character ablation experiment in the paper further proves the effect of this part.

Communication Protocol Design

In practical applications, although natural language has rich semantics, due to its unstructured characteristics, the information is often distorted or even distorted during the message transmission process. Loss of important content.

To solve this problem, the author constrained the agent to participate in collaboration with structured output (including documents and charts) to improve the clarity and completeness of the information. To verify this design, the authors designed a variety of software development tasks to emphasize the criticality of structured output in collaboration through the executability of the generated code and productivity indicators.

Agents divide work and collaborate like people, and can also exchange information through group chat

In order to improve communication efficiency during multi-agent collaboration, MetaGPT introduces a publish-subscribe mechanism based on message sharing(Publish-Subscribe Mechanism).

As shown in the figure above, the shared message pool allows messages to be exchanged directly, and any agent can transparently access messages from other agents without asking and waiting for a response. The subscription mechanism makes the agent more inclined to receive information related to its own tasks and avoid being distracted by irrelevant details. At the same time, each agent can directly retrieve the required information from the shared message pool to form self-memory.

Executable feedback

The intelligent agent self-optimizes and actively updates based on environmental feedback, which is a manifestation of the intelligent agent's autonomous consciousness.

In terms of software development tasks, MetaGPT has designed an executable feedback mechanism for engineers’ agents to automatically optimize code quality.

Specifically, engineers write and execute corresponding unit test cases, and make decisions and self-prompts recursively through the observed execution results to achieve automatic debugging. This iterative process of design-test-feedback continues until the unit test passes or the maximum number of retries is reached.

Multiple benchmarks test new SOTA

In terms of code generation capabilities, the researchers used two public benchmark data sets: HumanEval and MBPP, and reported the Pass@1 indicator.

In addition, they also collected the SoftwareDev data set covering 70 typical software development tasks (such as mini-games, data visualization, image processing, etc.) , and made multiple agents open source Comparison of frameworks, statistical analysis and qualitative description of the executability and production efficiency of multiple software development tasks.

As shown in the figure below, MetaGPT outperforms previous methods in both the HumanEval and MBPP benchmarks, reaching 85.9% and 87.7% respectively. Compared with the results of GPT-4, MetaGPT has a relative improvement of 28.2% on the HumanEval data set, and adding an executable feedback mechanism has improved 4.2% and 5.4% on HumanEval and MBPP respectively.

Agents divide work and collaborate like people, and can also exchange information through group chat

On the challenging SoftwareDev dataset, MetaGPT achieves an executability score of 3.75, very close to 4, while requiring a shorter running time(503 seconds);The number of generated lines of code increased by 2.24 times compared to the baseline framework, while the number of tokens consumed per unit line of code dropped by 50%.

These results highlight the efficiency improvements brought by SOPs during multi-agent collaboration.

Agents divide work and collaborate like people, and can also exchange information through group chat

MetaGPT’s high executability and relatively short running time in software development tasks demonstrate its practicality and efficiency in real-world applications.

Agents divide work and collaborate like people, and can also exchange information through group chat

Focusing on the field of software development, researchers provide a qualitative comparison of the capabilities of different agent frameworks.

They found that MetaGPT not only has the ability to generate files in multiple modes, but is also the only open source framework among many current frameworks that completely covers the software development process in the real world.

Agents divide work and collaborate like people, and can also exchange information through group chat

In general, MetaGPT is a novel multi-agent framework that combines meta-programming ideas and embeds SOPs to enhance the capabilities of large models in multi-agent collaboration.

Through role specialization, workflow management and flexible messaging mechanism, it becomes a multi-agent framework with high versatility and portability.

Combined with the iterative feedback mechanism, MetaGPT has achieved SOTA performance on multiple benchmark tests.

SOPs combined with human social practice inspire future research and exploration of multi-agent societies, and can also be regarded as an early attempt to regulate multi-agent frameworks based on large models.

Paper link: https://arxiv.org/abs/2308.00352
Code link: https://github.com/geekan/MetaGPT

The above is the detailed content of Agents divide work and collaborate like people, and can also exchange information through 'group chat'. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Sam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailSam's Club Bets On AI To Eliminate Receipt Checks And Enhance RetailApr 22, 2025 am 11:29 AM

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's AI Omniverse Expands At GTC 2025Nvidia's AI Omniverse Expands At GTC 2025Apr 22, 2025 am 11:28 AM

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Exploring the Capabilities of Google's Gemma 2 ModelsExploring the Capabilities of Google's Gemma 2 ModelsApr 22, 2025 am 11:26 AM

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

The Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaThe Next Wave of GenAI: Perspectives with Dr. Kirk Borne - Analytics VidhyaApr 22, 2025 am 11:21 AM

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

AI For Runners And Athletes: We're Making Excellent ProgressAI For Runners And Athletes: We're Making Excellent ProgressApr 22, 2025 am 11:12 AM

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Jamie Engstrom On Technology, Talent And Transformation At CaterpillarJamie Engstrom On Technology, Talent And Transformation At CaterpillarApr 22, 2025 am 11:10 AM

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

New Google Photos Update Makes Any Photo Pop With Ultra HDR QualityNew Google Photos Update Makes Any Photo Pop With Ultra HDR QualityApr 22, 2025 am 11:09 AM

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,

What are the TCL Commands in SQL? - Analytics VidhyaWhat are the TCL Commands in SQL? - Analytics VidhyaApr 22, 2025 am 11:07 AM

Introduction Transaction Control Language (TCL) commands are essential in SQL for managing changes made by Data Manipulation Language (DML) statements. These commands allow database administrators and users to control transaction processes, thereby

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor