search
HomeTechnology peripheralsAIBaidu founder Robin Li: We are about to enter an AI-native era

"Finance" new media writer Wang Jingya/text Gao Suying/editor

"We are about to enter an era of native AI, an era where humans and machines interact through prompts, and the future will be generated by us together.". On October 17, Baidu founder, chairman and CEO Robin Li said at the 2023 Baidu World Conference.

He announced on the spot that Baidu Wenxin was officially upgraded to version 4.0. Compared with the previous version, the new version has achieved significant improvements in the four major capabilities of understanding, generation, logic and memory, and its overall level is not inferior to GPT4. This is currently Baidu's strongest Wenxin large model, which has achieved a comprehensive upgrade of the basic model.

Baidu founder Robin Li: We are about to enter an AI-native era

Li Yanhong demonstrated the characteristics and application scenarios of Wen Xinyiyan’s four abilities of understanding, generation, logic, and memory. Robin Li believes that these capabilities are not available in the past era, so they can open up unlimited space for innovation.

Specifically, in terms of understanding ability, AI has developed from an "artificial retard" that cannot understand human speech to one that can understand almost all speech, and even understands what the user is saying better than the user's friends and colleagues. In terms of generation capabilities, based on a picture material and several key words provided by Robin Li, Wen Xinyiyan generated 1 advertising video, 5 pieces of advertising copy and 1 poster in just 3 minutes. Based on this ability, Baidu has launched Qingduo, an AIGC marketing creative platform.

In terms of logical ability, the application of the Wenxin large model is particularly obvious in scenarios such as solving mathematical problems and summarizing knowledge points. Robin Li said that in addition to problem solving, logical capabilities are required for route planning on smart maps, complex tasks handled by smart assistants, traffic light control in smart transportation systems, etc. Robin Li pointed out that in terms of memory ability, whether the AI ​​remembers what the user said and whether the content generated by the AI ​​is inconsistent before and after is an important indicator to distinguish the intelligence of a large model. Multiple rounds of dialogue are the embodiment of memory ability.

It should not be ignored that the four major capabilities of the large model do not exist independently, but are complementary to each other in specific scenarios. In Robin Li's view, understanding, generation, logic, and memory capabilities are the basis for the survival of all AI native applications. For example, when creating advertising copy, you need to understand the creative theme, clarify the creative logic, and maintain consistency through memory. In solving problems, these four abilities also need to be comprehensively applied.

It is worth mentioning that the ultimate goal of large-scale model technology from all walks of life is still to serve people, and practical application is the key to the development of AI. "AI native applications are applications developed based on the understanding, generation, logic and memory capabilities of large models." Robin Li believes that without rich AI native applications built on the basic model, the basic model has no value.

Robin Li demonstrated more than 10 AI native application cases based on Wen Xinyiyan's reconstruction of Baidu Search, Ruliu, Maps, Netdisk, and Wenku, hoping to inspire developers to work together to make more amazing things. AI native applications. In his view, "China has rich application scenarios, and Chinese users are willing to embrace new technologies. With advanced basic large models, we can build a prosperous AI ecosystem and jointly create a new round of economic growth."

When developing AI native applications, the basic capabilities of large models are crucial. Robin Li said that API is the main way for AI native applications to call basic large models. Currently, there are 42 mainstream large models settled on the Qianfan Large Model Platform, covering nearly 500 scenes in various industries.

It is worth noting that large model reconstruction will not only affect online applications, but will also affect offline work and life. A large number of AI native applications will continue to emerge, promoting the deep integration of digital technology and the real economy. At present, large model technology has been applied in manufacturing, energy, electric power, chemical industry, transportation and other real industries, and is becoming an important driving force for new industrialization.

Robin Li believes that a new world and a new future will be generated through prompts from every enterprise, every developer, and every user. Future AI native applications must be multi-modal and will reconstruct the physical world in addition to the information world.

The above is the detailed content of Baidu founder Robin Li: We are about to enter an AI-native era. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:搜狐. If there is any infringement, please contact admin@php.cn delete
I Tried Vibe Coding with Cursor AI and It's Amazing!I Tried Vibe Coding with Cursor AI and It's Amazing!Mar 20, 2025 pm 03:34 PM

Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

How to Use DALL-E 3: Tips, Examples, and FeaturesHow to Use DALL-E 3: Tips, Examples, and FeaturesMar 09, 2025 pm 01:00 PM

DALL-E 3: A Generative AI Image Creation Tool Generative AI is revolutionizing content creation, and DALL-E 3, OpenAI's latest image generation model, is at the forefront. Released in October 2023, it builds upon its predecessors, DALL-E and DALL-E 2

Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More!Top 5 GenAI Launches of February 2025: GPT-4.5, Grok-3 & More!Mar 22, 2025 am 10:58 AM

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

How to Use YOLO v12 for Object Detection?How to Use YOLO v12 for Object Detection?Mar 22, 2025 am 11:07 AM

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

Elon Musk & Sam Altman Clash over $500 Billion Stargate ProjectElon Musk & Sam Altman Clash over $500 Billion Stargate ProjectMar 08, 2025 am 11:15 AM

The $500 billion Stargate AI project, backed by tech giants like OpenAI, SoftBank, Oracle, and Nvidia, and supported by the U.S. government, aims to solidify American AI leadership. This ambitious undertaking promises a future shaped by AI advanceme

Sora vs Veo 2: Which One Creates More Realistic Videos?Sora vs Veo 2: Which One Creates More Realistic Videos?Mar 10, 2025 pm 12:22 PM

Google's Veo 2 and OpenAI's Sora: Which AI video generator reigns supreme? Both platforms generate impressive AI videos, but their strengths lie in different areas. This comparison, using various prompts, reveals which tool best suits your needs. T

Google's GenCast: Weather Forecasting With GenCast Mini DemoGoogle's GenCast: Weather Forecasting With GenCast Mini DemoMar 16, 2025 pm 01:46 PM

Google DeepMind's GenCast: A Revolutionary AI for Weather Forecasting Weather forecasting has undergone a dramatic transformation, moving from rudimentary observations to sophisticated AI-powered predictions. Google DeepMind's GenCast, a groundbreak

Which AI is better than ChatGPT?Which AI is better than ChatGPT?Mar 18, 2025 pm 06:05 PM

The article discusses AI models surpassing ChatGPT, like LaMDA, LLaMA, and Grok, highlighting their advantages in accuracy, understanding, and industry impact.(159 characters)

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.