


Sutu Network News On September 5, the AI model community ModelScope (ModelScope) led by Alibaba Cloud launched a new model, launching ModelScope-Agent, an AI Agent development framework adapted to open source large language models. With the help of ModelScope-Agent, all developers can build their own agent applications based on open source LLM, maximizing imagination and creativity. The code for ModelScope-Agent is now open source.
AI Agent (AI agent, AI agent) can be said to be the most popular research direction in the field of large models. It refers to an agent based on LLM that can use tools to autonomously complete specific tasks. As we all know, LLM at the current stage has capability boundaries and is mainly good at processing text tasks. AI Agent collaborates with LLM and other models, software and other external tools to handle various complex tasks in the real world, such as connecting to video generation models and independently producing videos; connecting to external software to help humans write emails, book tickets, etc. Shopping and more.
AI Agent applications are becoming an important track for large-scale model entrepreneurship, and the AI Agent development framework has also emerged. However, most of the existing AI Agent development frameworks in the industry are based on closed-source LLM. In order to fully unleash the productivity of open source LLM, the Moda community has developed an AI Agent development framework ModelScope-Agent that is adapted to open source LLM. In short, ModelScope-Agent is a pilot (pilot) rather than a copilot (co-pilot).
Previously, Moda has used this framework to do "proofing" in the community and built ModeScopeGPT, which can interact with users through natural language, accept user instructions, and call many AIs in the community through the "central model" Tongyi Qianwen Model API can autonomously complete tasks assigned by humans. ModeScopeGPT has been called more than 250,000 times.
(ModelScope-Agent system architecture diagram)
The workflow of AI Agent generally includes task understanding, planning and execution. Among them, LLM is responsible for task planning, tool invocation and reply generation. It is the intelligent center of the entire process, just like the human brain. ModelScope-Agent allows developers to freely choose the "brain" of the agent and adapt to various open source models such as Baichuan and Tongyi Qianwen. Mainstream LLM can be "plug and play". If developers need to specifically train LLM to enhance their tool calling capabilities, they can refer to the open source training data, training methods, optimization plans, and evaluation methods of ModelScope-Agent.
ModelScope-Agent development framework also includes memory control, tool usage and other modules. The memory control module supports knowledge retrieval and prompt (prompt word) management, and the tool usage module supports tool library, tool retrieval, and tool customization. In order to enhance the ability to call open source large model tools, the Moda community also developed a self-developed MSAgent-Bench tool calling data set containing 600,000 samples.
Most AI Agent development frameworks require manual addition of tools, while ModelScope-Agent creates an API tool retrieval engine by calling the open source text vector model on the Magic Community, allowing the AI Agent to automatically retrieve relevant tools according to user instructions. If developers want to add external tools/APIs, they only need to register the tools on ModelScope-Agent and call them directly.
Open source is becoming the choice of many large head model players. In the future, ModelScope-Agent will adapt to more new open source LLMs, and more applications developed based on ModelScope-Agent will be launched, such as personal assistant Agent, Story Agent, Multi-Agent, etc. MoDa encourages developers to use ModelScope-Agent to explore applications in industrial manufacturing, game development, intelligent interconnection and other industries.
MoDa community stated that as the largest and most active AI model open source community in China, MoDa will continue to promote the development of large model technology and applications through open source. Previously, Alibaba Cloud had stated that it would take promoting the prosperity of China's large model ecosystem as its primary goal and build Moda into China's largest free market for large models. (Gardenia Orange)
The above is the detailed content of Moda Community launches an AI Agent development framework, allowing everyone to 'build' intelligent agent applications based on open source LLM. For more information, please follow other related articles on the PHP Chinese website!

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Google's Veo 2 and OpenAI's Sora: Which AI video generator reigns supreme? Both platforms generate impressive AI videos, but their strengths lie in different areas. This comparison, using various prompts, reveals which tool best suits your needs. T

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 English version
Recommended: Win version, supports code prompts!

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

WebStorm Mac version
Useful JavaScript development tools

SublimeText3 Linux new version
SublimeText3 Linux latest version

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.
