Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators-It Industry-php.cn

Home

Technology peripherals

It Industry

Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 23, 2024 pm 01:37 PM

Wenxinyiyanapi call

In March 2024, in the "SuperBench Large Model Comprehensive Capability Evaluation Report" recently released by the Basic Model Research Center of Tsinghua University, the report comprehensively evaluated 14 influential models at home and abroad.

In this report, the outstanding performance of Wenian 4.0 has attracted widespread attention. Its overall performance is close to the top international models, and it is gradually narrowing the gap with the world's leading models, showing that it has become the leading domestic model.

Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators

In the evaluation of human alignment ability, Text 4.0 showed outstanding strength and ranked first in the country without any doubt. At the same time, in the evaluation of Chinese reasoning and Chinese language ability, Text 4.0 is also the best. Compared with other models, its advantages are very obvious. Especially in the evaluation of Chinese understanding, the score of Text 4.0 is 0.41 points higher than the second-placed GLM-4, showing its profound skills in Chinese processing.

In the evaluation of mathematical capabilities for semantic understanding, Text 4.0 and Claude-3 models tied for first place in the world, while the well-known GPT-4 series models followed closely behind, ranking fourth and fifth. The scores of other models are mostly concentrated around 55 points, and there is a significant gap between the leading groups.

Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators

#In the evaluation of reading comprehension ability, Wenxin 4.0 also shines. It not only surpassed GPT-4 Turbo and Claude-3, but also surpassed GLM-4 and achieved the highest score.

In the security evaluation that enterprises are most concerned about, Text GPT 4.0 also showed excellent performance. It reached a high score of 89.1 points, surpassing the world-class GPT-4 series models and Claude-3. ranked first, while Claude-3 only ranked fourth in this review.

The report also mentioned that since Wenxinyiyan made its public debut on March 16 last year, it has achieved a breakthrough in the number of users in a short period of time, and currently has more than 200 million users. At the same time, the number of daily API calls is also extremely active, exceeding 200 million times.

The above is the detailed content of Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:ITBear科技资讯. If there is any infringement, please contact admin@php.cn delete

Serverless Image Processing Pipeline with AWS ECS and LambdaApr 18, 2025 am 08:28 AM

This tutorial guides you through building a serverless image processing pipeline using AWS services. We'll create a Next.js frontend deployed on an ECS Fargate cluster, interacting with an API Gateway, Lambda functions, S3 buckets, and DynamoDB. Th

CNCF Arm64 Pilot: Impact and InsightsApr 15, 2025 am 08:27 AM

This pilot program, a collaboration between the CNCF (Cloud Native Computing Foundation), Ampere Computing, Equinix Metal, and Actuated, streamlines arm64 CI/CD for CNCF GitHub projects. The initiative addresses security concerns and performance lim

Building a Network Vulnerability Scanner with GoApr 01, 2025 am 08:27 AM

This Go-based network vulnerability scanner efficiently identifies potential security weaknesses. It leverages Go's concurrency features for speed and includes service detection and vulnerability matching. Let's explore its capabilities and ethical

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

SublimeText3 English version

Recommended: Win version, supports code prompts!

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version

God-level code editing software (SublimeText3)

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Atom editor mac version download

The most popular open source editor

Hot Topics

Where is the login entrance for gmail email?

7635

CakePHP Tutorial

1391

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

148