search
HomeTechnology peripheralsAIRobin Li: The threshold for Wen Xinyiyan's benchmark ChatGPT is very high, and Baidu is the first to do it among the world's major companies.

Robin Li: The threshold for Wen Xinyiyan's benchmark ChatGPT is very high, and Baidu is the first to do it among the world's major companies.

On the afternoon of March 16, Baidu held a press conference at its Beijing headquarters. The theme centered on the new generation of large language models and generative AI products.Wenxin​​One word. Robin Li, founder, chairman and CEO of Baidu, and Wang Haifeng, chief technology officer of Baidu, attended and demonstrated the five usage scenarios of Wen Xin Yi Yan in literary creation, business copywriting creation, mathematical calculation, Chinese understanding, and multi-modal generation. comprehensive ability.

Judging from the on-site demonstration, Wen Xinyiyan has the ability to understand human intentions to a certain extent, and the accuracy, logic, and fluency of his answers are gradually approaching human levels. However, Robin Li has also mentioned many times that this type of large language model is far from the stage of development and perfection, and there is a lot of room for improvement. In the future, it will definitely develop rapidly and change with each passing day.

Baidu also announced Wen Xinyiyan’s invitation test plan. Starting from March 16, the first batch of users can experience the product on Wenxinyiyan’s official website by inviting test codes, and it will be opened to more users in the future. In addition, Baidu Smart Cloud will soon open Wenxinyiyan API interface calling services to enterprise customers. Reservations will be officially opened on March 16. Search "Baidu Smart Cloud" to enter the official website, and you can apply to join the Wenxin Yiyan Cloud service test.

Currently, large language models and generative AI represent a new technological paradigm and are opportunities that every company in the world cannot miss. Baidu Wenxinyiyan is positioned as an artificial intelligence base-type empowerment platform that will assist the intelligent transformation of thousands of industries such as finance, energy, media, and government affairs. Robin Li said: "Baidu hopes to work with everyone to promote the advancement of artificial intelligence technology, so that everyone can use the most advanced productivity tools, so that everyone can benefit from it."

At the press conference, Robin Li showed The performance of Wenxinyiyan in five usage scenarios, including literary creation, business copywriting creation, mathematical calculation, Chinese understanding and multi-modal generation.

In the literary creation scene, Wen Xinyiyan summarized the core content of the well-known science fiction novel "The Three-Body Problem" based on dialogue questions, and put forward five suggested angles for continuing the "Three-Body Problem", embodying Develop comprehensive abilities in dialogue Q&A, summary analysis, and content creation.

In addition, Wen Xinyiyan accurately answered factual questions about the author of "The Three-Body Problem" and the role player in the TV series. Generative AI often "makes things up" when answering factual questions, and Wen Xinyiyan continues Baidu's knowledge-enhanced large model concept and greatly improves the accuracy of factual questions.

In the business copywriting creation scenario, Wen Xinyiyan successfully completed the creative tasks of naming the company, writing a slogan, and writing a press release.

In three consecutive content creations, Wen Xinyiyan was able to accurately understand human intentions and express them clearly. This is the "intelligence emergence" that occurs based on the huge scale of data. The training data of the Wenxin Yiyan large model includes trillions of web page data, billions of search data and image data, tens of billions of daily voice call data, and a knowledge graph of 550 billion facts.

Wen Xinyiyan also has a certain degree of thinking ability and can learn relatively complex tasks such as mathematical deductions and logical reasoning. Faced with classic questions such as "Chicken and rabbit in the same cage" that train human logical thinking, Wen Xinyiyan can understand the meaning of the question and have the correct ideas for solving the problem, and then follow the correct steps to calculate the problem step by step like a student. correct answer.

Literary creation, business copywriting creation, and mathematical calculation are common advantages and abilities of large language models. On this basis, Wenxinyiyan also shows better Chinese understanding and multi-modal generation capabilities.

During the on-site demonstration, Wen Xinyiyan correctly explained the meaning of the idiom "Luoyang paper is expensive" and the corresponding economic theory of "Luoyang paper is expensive", and also created an acrostic poem using the four words "Luoyang paper is expensive".

In terms of multi-modal generation, Robin Li demonstrated Wen Xin Yi Yan’s ability to generate text, pictures, audio and video. Interestingly, Wenxinyiyan can even generate speech in dialects such as Sichuan dialect; Wenxinyiyan’s video generation capability is not currently open to all users due to its high cost, and will be gradually accessed in the future.

“Multimodality is a clear development trend of generative AI.” Robin Li said, “In the future, as Baidu’s ability to unify large multimodal models increases, Wen Xinyiyan’s multimodal generation capabilities will It will also continue to improve."

Judging from Wen Xinyiyan's performance, to a certain extent, it has the ability to understand human intentions, and the accuracy, logic, and fluency of its answers are gradually approaching human levels. . But overall, this type of large language model is far from being fully developed and relies on gradual iteration through real user feedback.

Wang Haifeng said that Wenxinyiyan is a new generation of knowledge-enhanced large language model, which is developed on the basis of the ERNIE and PLATO series models. Its key technologies include supervised fine-tuning, reinforcement learning with human feedback, prompts, knowledge enhancement, retrieval enhancement and dialogue enhancement. The first three are technologies used by such large language models, and have been applied and accumulated in ERNIE and PLATO, and have been further strengthened and polished in Wen Xinyiyan; the last three are technologies that Baidu already has technical advantages. Re-innovation is also the foundation for Wen Xinyiyan to become stronger and stronger in the future.

Li Yanhong emphasized: "Wen Xinyiyan will establish a flywheel between real user feedback, developer calls and model iterations, and the effect will improve rapidly, giving you After three days of separation, it’s a surprise to see each other with admiration.” Robin Li said that Baidu is currently the first company among the world’s major companies to make a benchmark ChatGPT product. Robin Li pointed out: "No matter which company it is, it is impossible to build such a large language model in a few months. Deep learning and natural language processing require years of persistence and accumulation, and cannot be achieved quickly."

It can be said that Wen Xinyiyan is the continuation of Baidu’s efforts over the past many years. As humans enter the era of artificial intelligence, the technology stack of IT technology has undergone fundamental changes, from the past three layers to the four layers of "chip-framework-model-application". Today, Baidu is one of the few artificial intelligence companies in the world that has a full-stack layout in these four layers, from high-end chip Kunlun core, to Feipiao deep learning framework, to Wenxin pre-trained large models, to search, intelligent cloud, Applications such as autonomous driving and Xiaodu have industry-leading self-developed technologies at all levels.

Robin Li believes that the advantage of Baidu AI’s full-stack layout is that it can achieve end-to-end optimization in the four-layer architecture of the technology stack, greatly improving efficiency. In particular, there is a strong synergy between the framework layer and the model layer, which can help build more efficient models and significantly reduce costs. In fact, the training and inference of very large-scale models pose a great challenge to the deep learning framework. For example, in order to support efficient distributed training of hundreds of billions of parameter models, Baidu Flying Paddle has specially developed 4D hybrid parallel technology.

Since Baidu officially announced “Wen Xin Yi Yan” in February, more than 650 companies have announced their access to the Wen Xin Yi Yan ecosystem.

Robin Li predicts that large language models will bring three major industry opportunities.

The first category is a new type of cloud computing company, whose mainstream business model has changed from IaaS to MaaS. Wen Xin's words will fundamentally change the rules of the game in the cloud computing industry. In the past, enterprises chose cloud vendors based more on basic cloud services such as computing power and storage. In the future, more will depend on whether the framework is good, whether the model is good, and the collaboration between the four layers of model, framework, chip, and application.

Wen Xinyiyan will provide external services through Baidu Intelligent Cloud to help enterprises build their own models and applications. Key areas such as agriculture, industry, finance, education, medical care, transportation, and energy will greatly improve efficiency as a result. , and quickly form new industrial spaces in every industry to help realize Digital China. Robin Li predicted that Baidu Smart Cloud will hold a press conference in the near future, with the theme centered on Wen Xinyiyan’s cloud services and application products, which include both public cloud services and privatized deployment.

The second category is companies that fine-tune industry models. This is the middle layer between the general large model and enterprises. Based on their insights into the industry, they can use the general large model capabilities to provide solutions to industry customers. plan. In this regard, Baidu Wenxin Model has released more than 10 industry models in electric power, finance, media and other fields.

The third category is companies that develop applications based on large model bases, that is, application service providers. Robin Li asserted that for most entrepreneurs and companies, the real opportunity is not to build basic large-scale models like ChatGPT and Wenxinyiyan from scratch. This is very unrealistic and uneconomical. This may be the real opportunity to preemptively develop important application services based on a general large language model. At present, based on text generation, image generation, audio generation, video generation, digital people, 3D and other scenarios, many entrepreneurial star companies have emerged, which may be new giants in the future.

"We believe that artificial intelligence will completely change every industry we have today. The long-term value of AI and the disruptive changes to all walks of life have just begun. In the future, there will be more killers With the emergence of applications and phenomenal products, more milestone events will occur." Robin Li said. (one orange)

The above is the detailed content of Robin Li: The threshold for Wen Xinyiyan's benchmark ChatGPT is very high, and Baidu is the first to do it among the world's major companies.. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
What is Graph of Thought in Prompt EngineeringWhat is Graph of Thought in Prompt EngineeringApr 13, 2025 am 11:53 AM

Introduction In prompt engineering, “Graph of Thought” refers to a novel approach that uses graph theory to structure and guide AI’s reasoning process. Unlike traditional methods, which often involve linear s

Optimize Your Organisation's Email Marketing with GenAI AgentsOptimize Your Organisation's Email Marketing with GenAI AgentsApr 13, 2025 am 11:44 AM

Introduction Congratulations! You run a successful business. Through your web pages, social media campaigns, webinars, conferences, free resources, and other sources, you collect 5000 email IDs daily. The next obvious step is

Real-Time App Performance Monitoring with Apache PinotReal-Time App Performance Monitoring with Apache PinotApr 13, 2025 am 11:40 AM

Introduction In today’s fast-paced software development environment, ensuring optimal application performance is crucial. Monitoring real-time metrics such as response times, error rates, and resource utilization can help main

ChatGPT Hits 1 Billion Users? 'Doubled In Just Weeks' Says OpenAI CEOChatGPT Hits 1 Billion Users? 'Doubled In Just Weeks' Says OpenAI CEOApr 13, 2025 am 11:23 AM

“How many users do you have?” he prodded. “I think the last time we said was 500 million weekly actives, and it is growing very rapidly,” replied Altman. “You told me that it like doubled in just a few weeks,” Anderson continued. “I said that priv

Pixtral-12B: Mistral AI's First Multimodal Model - Analytics VidhyaPixtral-12B: Mistral AI's First Multimodal Model - Analytics VidhyaApr 13, 2025 am 11:20 AM

Introduction Mistral has released its very first multimodal model, namely the Pixtral-12B-2409. This model is built upon Mistral’s 12 Billion parameter, Nemo 12B. What sets this model apart? It can now take both images and tex

Agentic Frameworks for Generative AI Applications - Analytics VidhyaAgentic Frameworks for Generative AI Applications - Analytics VidhyaApr 13, 2025 am 11:13 AM

Imagine having an AI-powered assistant that not only responds to your queries but also autonomously gathers information, executes tasks, and even handles multiple types of data—text, images, and code. Sounds futuristic? In this a

Applications of Generative AI in the Financial SectorApplications of Generative AI in the Financial SectorApr 13, 2025 am 11:12 AM

Introduction The finance industry is the cornerstone of any country’s development, as it drives economic growth by facilitating efficient transactions and credit availability. The ease with which transactions occur and credit

Guide to Online Learning and Passive-Aggressive AlgorithmsGuide to Online Learning and Passive-Aggressive AlgorithmsApr 13, 2025 am 11:09 AM

Introduction Data is being generated at an unprecedented rate from sources such as social media, financial transactions, and e-commerce platforms. Handling this continuous stream of information is a challenge, but it offers an

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.