search
HomeTechnology peripheralsAIUnleash excellent programming resources, giant models and agents will trigger more powerful forces

Just as Rhysford's wand created the legend of extraordinary magicians such as Dumbledore in the past, traditional large-scale language models with huge potential, after pre-training/fine-tuning of code corpus, have mastered more beyond Original execution ability.

Specifically, the advanced version of the large model has been improved in terms of writing code, stronger reasoning, independent reference to execution interfaces, independent improvement, etc., which will provide As an AI agent, it brings benefits in all aspects when performing downstream tasks.

Recently, a research team from the University of Illinois at Urbana-Champaign (UIUC) published an important review.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

Paper link: https://arxiv.org/abs/2401.00812

This review explores the code (Code) How to give large language models (LLMs) and their intelligent agents (Intelligent Agents) based on them powerful capabilities. Unleash excellent programming resources, giant models and agents will trigger more powerful forces
Among them, code specifically refers to a formal language that is machine-executable and human-readable, such as a programming language, a predefined function set, etc. Similar to how we guide LLMs to understand/generate traditional natural language, making LLMs proficient in code only requires applying the same language modeling training objectives to code data.

Different from traditional language models, today’s commonly used LLMs, such as Llama2 and GPT4, have not only significantly improved in size, but they have also undergone development independent of typical natural language corpora. code corpus training. Code has standardized syntax, logical consistency, abstraction and modularity, and can transform high-level goals into executable steps, making it an ideal medium to connect humans and computers.

As shown in Figure 2, in this review, the researchers compiled relevant work and analyzed in detail the various advantages of incorporating code into LLMs training data.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

Specifically, the researchers observed that unique properties of code contribute to:

1. Enhance the code writing capabilities, reasoning capabilities, and structured information processing capabilities of LLMs so that they can be applied to more complex natural language tasks;
2. Guide LLMs to generate structured and accurate Intermediate steps, these steps can be connected to the external execution end through function calls;
3. Use the compilation and execution environment of the code to provide diverse feedback for independent improvement of the model.

In addition, the researchers also deeply investigated the optimization items of these LLMs given by the code, how to strengthen them as the decision-making center of the Intelligent Agent, understand instructions, decompose goals, and plan and a set of abilities to perform actions and improve from feedback.

As shown in Figure 3, in the first part, the researchers found that the pre-training of LLMs on code has expanded the task scope of LLMs to natural language. outside. These models can support a variety of applications, including code generation for mathematical theories, general programming tasks, and data retrieval. Code needs to produce a logically coherent, ordered sequence of steps, which is essential for effective execution. Additionally, the executability of each step in the code allows step-by-step verification of the logic. Exploiting and embedding these code attributes in pre-training improves the Chain of Thought (CoT) performance of LLMs in many traditional natural language downstream tasks, validating their improvement in complex reasoning skills. At the same time, by implicitly learning the structured format of code, codeLLMs perform better on common-sense structured reasoning tasks, such as those related to markup languages, HTML, and diagram understanding.
Unleash excellent programming resources, giant models and agents will trigger more powerful forces
As shown in Figure 4, connecting LLMs with other functional ends (that is, extending LLMs capabilities through external tools and execution modules) helps LLMs to more accurately and reliably Perform tasks.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

In the second part, as shown in Table 1, researchers observed a general trend: LLMs establish connections with other functional endpoints by generating programming languages ​​or leveraging predefined functions. This "code-centric paradigm" differs from the rigid approach of strictly hardcoding tool calls in the inference mechanism of LLMs, which allows LLMs to dynamically generate tokens that call execution modules, with adjustable parameters.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

This paradigm provides a simple and clear way for LLMs to interact with other functional ends, enhancing the flexibility and scalability of their applications. sex. More importantly, it also allows LLMs to interact with numerous functional endpoints covering multiple modalities and domains. By expanding the number and variety of functional terminals accessible to LLMs, LLMs are able to handle more complex tasks.

As shown in Figure 5, embedding LLMs into the code execution environment can achieve automated feedback and independent model improvement. LLMs perform beyond the range of their training parameters, in part because they are able to accommodate feedback. However, feedback must be chosen carefully as noisy cue input may impede the performance of LLMs on downstream tasks. Furthermore, since human resources are expensive, feedback needs to be collected automatically while maintaining authenticity. In the third part, the researchers found that embedding LLMs into the code execution environment can yield feedback that meets all of these criteria.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

First of all, since code execution is deterministic, obtaining feedback from the results of executing code can directly and faithfully reflect the tasks performed by LLM. Additionally, code interpreters provide LLMs with a way to automatically query internal feedback, eliminating the need for expensive human annotations when leveraging LLMs to debug or optimize erroneous code. The Code compilation and execution environment also allows LLMs to incorporate diverse and comprehensive external feedback forms, such as simple generation of binary correct and error evaluations, slightly more complex natural language explanations of execution results, and various rankings with feedback values. methods, they all make the methods of improving performance highly customizable.

By analyzing various ways in which code training data integration enhances the capabilities of LLMs, researchers further discovered that the advantage of code empowering LLMs lies in the key development of Intelligent Agent. LLM application areas are particularly obvious.

Figure 6 shows the standard workflow of an intelligent assistant. The researchers observed that the improvements brought about by code training in LLMs also affected the actual steps they performed as intelligent assistants.

Unleash excellent programming resources, giant models and agents will trigger more powerful forces

These steps include: (1) enhancing IA’s decision-making capabilities in environmental awareness and planning, (2) implementing actions in modular action primitives and efficient organization of memory to optimize policy execution, and (3) optimize performance through feedback automatically derived from the code execution environment.

In summary, in this review, researchers analyze and clarify how code gives LLMs powerful capabilities, and how code assists LLMs in working as decision-making centers for Intelligent Agents .

Through a comprehensive literature review, the researchers observed that after code training, LLMs improved their programming skills and reasoning capabilities, and gained implementation and cross-modal and domain expertise. Flexible connection capabilities for multiple function terminals, as well as enhanced ability to interact with evaluation modules integrated in the code execution environment and achieve automatic self-improvement.

In addition, the improved capabilities of LLMs brought by code training help them perform as Intelligent Agents in downstream applications, reflected in specific tasks such as decision-making, execution, and self-improvement. Steps. In addition to reviewing previous research, the researchers also proposed several challenges in the field as guiding elements for potential future directions.

Please refer to the original article for more details!

The above is the detailed content of Unleash excellent programming resources, giant models and agents will trigger more powerful forces. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:机器之心. If there is any infringement, please contact admin@php.cn delete
[Ghibli-style images with AI] Introducing how to create free images with ChatGPT and copyright[Ghibli-style images with AI] Introducing how to create free images with ChatGPT and copyrightMay 13, 2025 am 01:57 AM

The latest model GPT-4o released by OpenAI not only can generate text, but also has image generation functions, which has attracted widespread attention. The most eye-catching feature is the generation of "Ghibli-style illustrations". Simply upload the photo to ChatGPT and give simple instructions to generate a dreamy image like a work in Studio Ghibli. This article will explain in detail the actual operation process, the effect experience, as well as the errors and copyright issues that need to be paid attention to. For details of the latest model "o3" released by OpenAI, please click here⬇️ Detailed explanation of OpenAI o3 (ChatGPT o3): Features, pricing system and o4-mini introduction Please click here for the English version of Ghibli-style article⬇️ Create Ji with ChatGPT

Explaining examples of use and implementation of ChatGPT in local governments! Also introduces banned local governmentsExplaining examples of use and implementation of ChatGPT in local governments! Also introduces banned local governmentsMay 13, 2025 am 01:53 AM

As a new communication method, the use and introduction of ChatGPT in local governments is attracting attention. While this trend is progressing in a wide range of areas, some local governments have declined to use ChatGPT. In this article, we will introduce examples of ChatGPT implementation in local governments. We will explore how we are achieving quality and efficiency improvements in local government services through a variety of reform examples, including supporting document creation and dialogue with citizens. Not only local government officials who aim to reduce staff workload and improve convenience for citizens, but also all interested in advanced use cases.

What is the Fukatsu-style prompt in ChatGPT? A thorough explanation with example sentences!What is the Fukatsu-style prompt in ChatGPT? A thorough explanation with example sentences!May 13, 2025 am 01:52 AM

Have you heard of a framework called the "Fukatsu Prompt System"? Language models such as ChatGPT are extremely excellent, but appropriate prompts are essential to maximize their potential. Fukatsu prompts are one of the most popular prompt techniques designed to improve output accuracy. This article explains the principles and characteristics of Fukatsu-style prompts, including specific usage methods and examples. Furthermore, we have introduced other well-known prompt templates and useful techniques for prompt design, so based on these, we will introduce C.

What is ChatGPT Search? Explains the main functions, usage, and fee structure!What is ChatGPT Search? Explains the main functions, usage, and fee structure!May 13, 2025 am 01:51 AM

ChatGPT Search: Get the latest information efficiently with an innovative AI search engine! In this article, we will thoroughly explain the new ChatGPT feature "ChatGPT Search," provided by OpenAI. Let's take a closer look at the features, usage, and how this tool can help you improve your information collection efficiency with reliable answers based on real-time web information and intuitive ease of use. ChatGPT Search provides a conversational interactive search experience that answers user questions in a comfortable, hidden environment that hides advertisements

An easy-to-understand explanation of how to create a composition in ChatGPT and prompts!An easy-to-understand explanation of how to create a composition in ChatGPT and prompts!May 13, 2025 am 01:50 AM

In a modern society with information explosion, it is not easy to create compelling articles. How to use creativity to write articles that attract readers within a limited time and energy requires superb skills and rich experience. At this time, as a revolutionary writing aid, ChatGPT attracted much attention. ChatGPT uses huge data to train language generation models to generate natural, smooth and refined articles. This article will introduce how to effectively use ChatGPT and efficiently create high-quality articles. We will gradually explain the writing process of using ChatGPT, and combine specific cases to elaborate on its advantages and disadvantages, applicable scenarios, and safe use precautions. ChatGPT will be a writer to overcome various obstacles,

How to create diagrams using ChatGPT! Illustrated loading and plugins are also explainedHow to create diagrams using ChatGPT! Illustrated loading and plugins are also explainedMay 13, 2025 am 01:49 AM

An efficient guide to creating charts using AI Visual materials are essential to effectively conveying information, but creating it takes a lot of time and effort. However, the chart creation process is changing dramatically due to the rise of AI technologies such as ChatGPT and DALL-E 3. This article provides detailed explanations on efficient and attractive diagram creation methods using these cutting-edge tools. It covers everything from ideas to completion, and includes a wealth of information useful for creating diagrams, from specific steps, tips, plugins and APIs that can be used, and how to use the image generation AI "DALL-E 3."

An easy-to-understand explanation of ChatGPT Plus' pricing structure and payment methods!An easy-to-understand explanation of ChatGPT Plus' pricing structure and payment methods!May 13, 2025 am 01:48 AM

Unlock ChatGPT Plus: Fees, Payment Methods and Upgrade Guide ChatGPT, a world-renowned generative AI, has been widely used in daily life and business fields. Although ChatGPT is basically free, the paid version of ChatGPT Plus provides a variety of value-added services, such as plug-ins, image recognition, etc., which significantly improves work efficiency. This article will explain in detail the charging standards, payment methods and upgrade processes of ChatGPT Plus. For details of OpenAI's latest image generation technology "GPT-4o image generation" please click: Detailed explanation of GPT-4o image generation: usage methods, prompt word examples, commercial applications and differences from other AIs Table of contents ChatGPT Plus Fees Ch

Explaining how to create a design using ChatGPT! We also introduce examples of use and promptsExplaining how to create a design using ChatGPT! We also introduce examples of use and promptsMay 13, 2025 am 01:47 AM

How to use ChatGPT to streamline your design work and increase creativity This article will explain in detail how to create a design using ChatGPT. We will introduce examples of using ChatGPT in various design fields, such as ideas, text generation, and web design. We will also introduce points that will help you improve the efficiency and quality of a variety of creative work, such as graphic design, illustration, and logo design. Please take a look at how AI can greatly expand your design possibilities. table of contents ChatGPT: A powerful tool for design creation

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

VSCode Windows 64-bit Download

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft