search
HomeTechnology peripheralsAIWill Baidu's ERNIE 4.5 & X1 Replace GPT-4.5 and DeepSeek-R1?

China has done it again with its AI models and this time the blow is bigger and better! Baidu – a Chinese AI company, recently released two large language models (LLMs) – ERNIE 4.5 & X1. Claiming to perform better than OpenAI’s latest & greatest model to date – GPT-4.5, these models are more cost-efficient than DeepSeek-R1! The models seem too good to be true – offering high quality at a fraction of the price. In this blog, we’ll explore the ERNIE 4.5 & X1 models, evaluate their benchmark results, and see how they perform in real-world applications. So, let’s begin.

Table of Contents

  • What are ERNIE 4.5 & X1?
    • ERNIE 4.5
    • ERNIE X1
  • How to Access ERNIE 4.5 & X1?
  • ERNIE 4.5 & X1 Performance Check
    • Task 1: Reasoning Image Analysis
    • Task 2: Document Analysis Summarization
    • Task 3: Audio Analysis
    • Task 4: Creativity Image Generation
  • Baidu’s ERNIE 4.5 & X1: Pricing
  • ERNIE 4.5 & X1: Standard Benchmark Results
  • Future Impact
  • Conclusion
  • Frequently Asked Questions

What are ERNIE 4.5 & X1?

ERNIE 4.5 & X1 are the two latest multimodal LLMs developed by the leading Chinese tech company Baidu, specializing in internet services, artificial intelligence, and autonomous driving. It is best known for its dominant search engine in China and advancements in AI-driven innovations. Baidu launched its first LLM, ERNIE 3.0 Titan, back in December 2021. After that, it has released a few more models, while working simultaneously to build more robust LLMs. The result of all the research and continuous efforts is ERNIE 4.5 & X1.

ERNIE 4.5

ERNIE 4.5 is a multimodal foundation model capable of understanding and integrating various data types, including text, images, audio, and video. This diverse modeling approach enhances its ability to understand and generate different kinds of content.

Here are some of the key features of ERNIE 4.5:

  • ERNIE 4.5 shows comprehensive improvements in understanding, generation, reasoning, and memory over its predecessor, ERNIE 4.0.
  • It shows great abilities in hallucination prevention, logical reasoning, and coding, making it adept at handling complex tasks with higher accuracy. ​
  • The model even performs better than OpenAI’s GPT-4.5 in multiple benchmarks, while it only costs 1% of what it costs to use GPT-4.5!

ERNIE X1

ERNIE X1 is designed as a deep-thinking reasoning model with multimodal capabilities. It’s a first of its kind deep thinking model released by Baidu. Here are some of its key features:

  • ERNIE X1 excels in understanding context, planning its thought process, reflecting on its response, and evolving over time.
  • It is capable of autonomously utilizing various tools for tasks such as advanced search, image understanding, and complex calculations.
  • The model delivers performance on par with DeepSeek-R1 but at half the price, offering a cost-effective solution for enterprises seeking advanced AI capabilities.

How to Access ERNIE 4.5 & X1?

You can access ERNIE 4.5 & X1 either through their AI Chatbot – ERNIE Bot, or through APIs.

Access via Bot:

  • Head to https://yiyan.baidu.com.
  • Create your account by adding your details and get started.

Both the models are freely accessible to individual users on Baidu’s ERNIE Bot platform. However, the registration for ERNIE Bot is currently limited to Chinese nationals.

Access via API:

  • Head to Baidu AI Cloud’s MaaS platform, Qianfan
  • Create your account on the platform to get started.

Currently, the platform can’t be accessed by all users. Also, only ERNIE 4.5 is available via API, while ERNIE X1 will soon be made available on the platform.

ERNIE 4.5 & X1 Performance Check

In this section, we’ll find out how these models perform at tasks involving multimedia, reasoning, document analysis, and more. Since the model interface only supports Chinese language, and the account creation is limited to Chinese nationals, we will look at some examples of how people are using the two models, and the outputs they have received. We will be covering some of the most common use cases of ERNIE 4.5 & X1 we have found online, including:

  1. Reasoning with Image Analysis
  2. Document Analysis and Summarization
  3. Audio Analysis
  4. Creativity and Image Generation

Task 1: Reasoning Image Analysis

In this task, the model was asked to solve a mathematical problem which was given to it in the form of an image.

Model used: ERNIE 4.5

Output:

Just like most other multimodal LLMs, ERNIE 4.5 quickly analyses the video and solves the problem in the image. It takes all the questions in the image one by one, and finally summarizes them all. The speed and accuracy of its performance make it a useful tool for students, educators, researchers, and professionals who require fast and accurate problem-solving.

Task 2: Document Analysis Summarization

Here, the model was given a document and it had to summarize the information on a particular topic from that document.

Model used: ERNIE 4.5

Output:

The model allows you to upload multiple files of various types, all at once. It is capable of processing files of different types, including docs, PDFs, PPTs, Excel sheets, and more. From the uploaded files, you can select the one (or more) that you wish to query the chatbot about and the model quickly summarizes the topic. Its quick processing of multiple files can be very useful for tasks like research analysis, legal document review, financial data extraction, and corporate reporting.

Task 3: Audio Analysis

For this task, the model had to analyze the given audio and find its source.

Model used: ERNIE 4.5

Output:

Audio analysis is a feature that none of the popular AI chatbots have incorporated within their interface, making ERNIE 4.5, the first of its kind. The model quickly analyzes the clip, determines its source, and then even goes on to describe the significance of the clip. Its quick analysis and the detailed description, make it a valuable tool for tasks like real-time transcription, voice-based search, deepfake detection, and sentiment analysis across media, customer service, education, and law enforcement.

Task 4: Creativity Image Generation

For this task, the model had to analyze a room and suggest possible decorations that can enhance its overall appeal. It then had to generate an updated image of the room.

Model used: ERNIE X1

Output:

The model quickly processes the image. It then suggests the possible improvements to the room’s decor to enhance the overall appeal. Finally, it generates the image of the room with all the suggested enhancements. This feature is a great addition for tasks like interior designing, home renovation planning, real estate staging, and virtual decor visualization.

Note: We have taken the examples from this post on X.

Baidu’s ERNIE 4.5 & X1: Pricing

Both ERNIE 4.5 & X1 have all the features, and even more, compared to the top models by OpenAI, DeepSeek, Grok, Claude, etc. Here is a pricing breakdown of the two models:

Model Input Price(per million tokens) Output Price(per million tokens) Availability
ERNIE 4.5 $0.55 $2.20 Available
ERNIE X1 $0.28 $1.10 Not yet available

Compared to other top models, ERNIE 4.5 & X1 are significantly cheaper, making them a valuable asset in the advancement of generative AI.

Will Baidu's ERNIE 4.5 & X1 Replace GPT-4.5 and DeepSeek-R1?

ERNIE 4.5 & X1: Standard Benchmark Results

We have already seen the features, capabilities, and the pricing of the latest ERNIE models. Now let’s look at some performance numbers of these models against top models like GPT-4.5, GPT-4o, DeepSeek-R1, and more.

The graph below compares ERNIE 4.5 and GPT-4o across multiple benchmarks that test multimodal AI performance.

Will Baidu's ERNIE 4.5 & X1 Replace GPT-4.5 and DeepSeek-R1?

The graph shows that:

  • ERNIE 4.5 outperforms GPT-4o in most multimodal tasks.
  • The average score for ERNIE 4.5 is 77.77, which is higher than GPT-4o’s 73.92.
  • ERNIE 4.5 has a significant edge in MathVista and DocVQA, showing better math reasoning and document-based question-answering skills.
  • Both models perform similarly in OCRBench and MMMU, but ERNIE 4.5 still has a slight advantage.

The next graph compares ERNIE 4.5, DeepSeek V3 – Chat, GPT-4o, and GPT-4.5 across multiple benchmarks for text-based reasoning and problem-solving.

Will Baidu's ERNIE 4.5 & X1 Replace GPT-4.5 and DeepSeek-R1?

Here are some key takeaways from the graph:

  • ERNIE 4.5 leads the pack with an average score of 79.6, narrowly surpassing DeepSeek V3 – Chat at 79.14.
  • It performs well across general knowledge, reasoning, and programming benchmarks such as MMLU-Pro, GSM8K, and HumanEval .
  • GPT-4o and DeepSeek V3 also demonstrate strong results, with DeepSeek V3 performing competitively in Chinese benchmarks like CMMLU.
  • ERNIE 4.5 excels in GSM8K (math) and C-Eval (general reasoning), although DeepSeek V3 is very close in performance.

Future Impact

The race to be the top LLM is heating up and Baidu’s ERNIE 4.5 & X1 introduce serious competition for OpenAI, DeepSeek, Anthropic, and Meta. With Chinese AI labs delivering models that rival or surpass Western AI at a fraction of the cost, companies will be forced to innovate faster and lower their costs to stay competitive.

All these advancements will finally lead to:

  • Faster AI advancements across all major AI research centers.
  • More affordable AI for businesses and developers.
  • A new era of multimodal AI applications, expanding beyond traditional text-based AI.

Conclusion

Baidu’s ERNIE 4.5 & X1 models are not just another set of AI models – they are industry disruptors. Their superior multimodal and reasoning capabilities, low pricing, and deep integration into China’s digital ecosystem, signal a power shift in the global AI market.

If this trend continues, we would see a larger scale AI democratisation and outreach across various industries. This would also push many western companies to release cheaper models. Not only would this add to competitiveness in the market, but would also ensure that the users get the most value for their money.

Frequently Asked Questions

Q1. What are ERNIE 4.5 & X1?

A. ERNIE 4.5 & X1 are the latest large language models (LLMs) developed by Baidu, designed to rival top AI models like OpenAI’s GPT-4.5 and DeepSeek-R1. ERNIE 4.5 is a multimodal foundation model, while ERNIE X1 is a deep-thinking reasoning model with advanced capabilities.

Q2. How is Baidu’s ERNIE 4.5 different from ERNIE X1?

A. ERNIE 4.5 is optimized for multimodal understanding, capable of processing text, images, audio, and video with high accuracy. ERNIE X1, on the other hand, is designed for deep-thinking reasoning, excelling in context understanding, planning, and problem-solving with self-reflection.

Q3. How do ERNIE 4.5 & X1 compare to OpenAI’s GPT-4.5?

A. Baidu ERNIE 4.5 outperforms GPT-4.5 in multiple benchmarks, particularly in reasoning, multimodal understanding, and hallucination prevention, while costing only 1% of GPT-4.5’s price. ERNIE X1 delivers DeepSeek-R1 level performance at half the cost, making them highly competitive AI solutions.

Q4. What are the pricing details for ERNIE 4.5 & X1?

A. ERNIE 4.5: Input cost $0.55 per 1M tokens, output cost $2.20 per 1M tokens.
ERNIE X1: Input cost $0.28 per 1M tokens, output cost $1.10 per 1M tokens.
The ERNIE X1 model is not yet available via API but will be soon.

Q5. How can I access ERNIE 4.5 & X1?

A. You can access these models through:
1. ERNIE Bot (AI Chatbot) at yiyan.baidu.com (Only available for Chinese users).
2. Baidu AI Cloud’s MaaS platform, Qianfan, for API access (currently only ERNIE 4.5 is available).

The above is the detailed content of Will Baidu's ERNIE 4.5 & X1 Replace GPT-4.5 and DeepSeek-R1?. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
ServiceNow Challenges Traditional CRM At Knowledge 2025 ConferenceServiceNow Challenges Traditional CRM At Knowledge 2025 ConferenceMay 16, 2025 am 03:45 AM

The Evolution of CRM in a Connected MarketplaceUnderstanding the evolving CRM landscape is essential. In today's interconnected market, customers leverage digital platforms and social media to exchange experiences and impact buying decisions. This in

[AI Video] An easy-to-understand explanation of how to summarise YouTube and prompts in ChatGPT![AI Video] An easy-to-understand explanation of how to summarise YouTube and prompts in ChatGPT!May 16, 2025 am 03:37 AM

AI is essential for efficient information gathering. In this article, we will explain three ways to summarise YouTube videos using ChatGPT. It also introduces the advantages and disadvantages of ChatGPT summary, as well as recommended free AI tools, and covers practical techniques for making effective use of video content. Dramatically improve the efficiency of information collection and analysis with the latest technology. Click here for more information about OpenAI's latest AI agent, OpenAI Deep Research ⬇️ summary In this article, we will introduce you to YouTube using ChatGPT.

What is OpenAI o3 (ChatGPT o3)? Explaining how to use it, fees, and restrictions!What is OpenAI o3 (ChatGPT o3)? Explaining how to use it, fees, and restrictions!May 16, 2025 am 03:21 AM

OpenAI has released a remarkable new generation of AI models: OpenAI o3 (Osri) and o4-mini (Off Mini), which has attracted global attention. Among them, o3 is known as the smartest and most efficient inference model for OpenAI to date, and is expected to take AI capabilities to a new level. This article will provide an in-depth interpretation of OpenAI o3, covering its amazing features, usage methods, pricing system, access methods, and differences from previous models. In addition, we will introduce in detail the once highly anticipated successor of the "o3-mini", which achieves high-speed, cost-effective operation. We will explore the powerful deep thinking ability of O3 and the o4-mini

Explaining how to create a graduation thesis with ChatGPT! Also introduce points and points to noteExplaining how to create a graduation thesis with ChatGPT! Also introduce points and points to noteMay 16, 2025 am 03:07 AM

ChatGPT: A powerful ally in writing graduation thesis, but don't forget to be ethics and responsibility! ChatGPT is a powerful tool to streamline and improve the quality of your graduation thesis. However, it is essential to use it in compliance with academic ethics, with always keeping in mind that it is the ultimate responsibility of the author himself. In this article, we will explain in seven steps how to create a graduation thesis using ChatGPT. From theme selection to final proofreading, learn how to effectively utilize ChatGPT and aim to create a fulfilling paper. table of contents A step to prepare graduation thesis using ChatGPT

Make your email creation more efficient with ChatGPT! Explaining examples of prompts and points to be careful aboutMake your email creation more efficient with ChatGPT! Explaining examples of prompts and points to be careful aboutMay 16, 2025 am 02:48 AM

Efficient writing of business emails: Use ChatGPT to improve efficiency Business email is an indispensable tool in business communication, but writing is time-consuming and labor-intensive. In particular, business emails require strict language and formatting and need to be carefully considered. This article will introduce how to use the latest AI technologies to write high-quality emails efficiently. We will explain how to use the conversational AI service ChatGPT developed by OpenAI, as well as email writing tips, precautions and common tools. Helps you write business emails smoothly and greatly improve work efficiency. We also provide the AI-enabled marketing tool "AI Marketer". Reservations are now accepted. Interested friends please click the link below to view details. ▼Service details and application▼ AI Marketing Tool

How Powerful Nations Are Using Visas To Win The Global AI Talent RaceHow Powerful Nations Are Using Visas To Win The Global AI Talent RaceMay 16, 2025 am 02:13 AM

The globe's leading nations are fiercely competing for a shrinking group of elite AI researchers. They are employing accelerated visa procedures and fast-tracked citizenship to draw in the top international talent. This international race is turning

Do I need a phone number to register for ChatGPT? We also explain what to do if you can't registerDo I need a phone number to register for ChatGPT? We also explain what to do if you can't registerMay 16, 2025 am 01:24 AM

No mobile number is required for ChatGPT registration? This article will explain in detail the latest changes in the ChatGPT registration process, including the advantages of no longer mandatory mobile phone numbers, as well as scenarios where mobile phone number authentication is still required in special circumstances such as API usage and multi-account creation. In addition, we will also discuss the security of mobile phone number registration and provide solutions to common errors during the registration process. ChatGPT registration: Mobile phone number is no longer required In the past, registering for ChatGPT required mobile phone number verification. But an update in December 2023 canceled the requirement. Now, you can easily register for ChatGPT by simply having an email address or Google, Microsoft, or Apple account. It should be noted that although it is not necessary

Top Ten Uses Of AI Puts Therapy And Companionship At The #1 SpotTop Ten Uses Of AI Puts Therapy And Companionship At The #1 SpotMay 16, 2025 am 12:43 AM

Let's delve into the fascinating world of AI and its top uses as outlined in the latest analysis.This exploration of a groundbreaking AI development is a continuation of my ongoing Forbes column, where I delve into the latest advancements in AI, incl

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Nordhold: Fusion System, Explained
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Clair Obscur: Expedition 33 - How To Get Perfect Chroma Catalysts
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

MinGW - Minimalist GNU for Windows

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools