


Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators
In March 2024, in the "SuperBench Large Model Comprehensive Capability Evaluation Report" recently released by the Basic Model Research Center of Tsinghua University, the report comprehensively evaluated 14 influential models at home and abroad.
In this report, the outstanding performance of Wenian 4.0 has attracted widespread attention. Its overall performance is close to the top international models, and it is gradually narrowing the gap with the world's leading models, showing that it has become the leading domestic model.
In the evaluation of human alignment ability, Text 4.0 showed outstanding strength and ranked first in the country without any doubt. At the same time, in the evaluation of Chinese reasoning and Chinese language ability, Text 4.0 is also the best. Compared with other models, its advantages are very obvious. Especially in the evaluation of Chinese understanding, the score of Text 4.0 is 0.41 points higher than the second-placed GLM-4, showing its profound skills in Chinese processing.
In the evaluation of mathematical capabilities for semantic understanding, Text 4.0 and Claude-3 models tied for first place in the world, while the well-known GPT-4 series models followed closely behind, ranking fourth and fifth. The scores of other models are mostly concentrated around 55 points, and there is a significant gap between the leading groups.
#In the evaluation of reading comprehension ability, Wenxin 4.0 also shines. It not only surpassed GPT-4 Turbo and Claude-3, but also surpassed GLM-4 and achieved the highest score.
In the security evaluation that enterprises are most concerned about, Text GPT 4.0 also showed excellent performance. It reached a high score of 89.1 points, surpassing the world-class GPT-4 series models and Claude-3. ranked first, while Claude-3 only ranked fourth in this review.
The report also mentioned that since Wenxinyiyan made its public debut on March 16 last year, it has achieved a breakthrough in the number of users in a short period of time, and currently has more than 200 million users. At the same time, the number of daily API calls is also extremely active, exceeding 200 million times.
The above is the detailed content of Wenxin 4.0 performed well in the SuperBench evaluation, leading in many indicators. For more information, please follow other related articles on the PHP Chinese website!

This tutorial guides you through building a serverless image processing pipeline using AWS services. We'll create a Next.js frontend deployed on an ECS Fargate cluster, interacting with an API Gateway, Lambda functions, S3 buckets, and DynamoDB. Th

This pilot program, a collaboration between the CNCF (Cloud Native Computing Foundation), Ampere Computing, Equinix Metal, and Actuated, streamlines arm64 CI/CD for CNCF GitHub projects. The initiative addresses security concerns and performance lim

This Go-based network vulnerability scanner efficiently identifies potential security weaknesses. It leverages Go's concurrency features for speed and includes service detection and vulnerability matching. Let's explore its capabilities and ethical


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SublimeText3 English version
Recommended: Win version, supports code prompts!

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Mac version
God-level code editing software (SublimeText3)

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Atom editor mac version download
The most popular open source editor