search
HomeTechnology peripheralsAIAlibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4

On October 31, Alibaba Cloud officially released Tongyi Qianwen 2.0, a large model with hundreds of billions of parameters. In 10 authoritative evaluations, the comprehensive performance of Tongyi Qianwen 2.0 exceeded GPT-3.5 and is currently Accelerate to catch up with GPT-4. On the same day, Tongyi Qianwen APP was officially launched in major mobile application markets, and everyone can directly experience the latest model capabilities through the APP.

In the past six months, Tongyi Qianwen 2.0 has made a huge leap in performance. Compared with version 1.0 released in April, Tongyi Qianwen 2.0has been significantly improvedin the abilities of understanding complex instructions, literary creation, general mathematics, knowledge memory, and resisting hallucinations. At present, the comprehensive performance of

Tongyi Qianwen has exceeded GPT-3.5, accelerating to catch up with GPT-4.

Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4

Picture: Tongyi Qianwen 2.0 comprehensive performancehas exceeded GPT-3.5 and is accelerating to catch up GPT-4


in MMLU, C-Eval, GSM8K, HumanEval, MATH, etc. 10 On a

mainstream benchmark evaluation set, Tongyi Qianwen 2.0's overall score surpassed Meta's Llama-2-70B, compared with OpenAI's Chat-3.5, it was nine wins and one loss, and compared with GPT-4, it was With four wins and six losses, the gap with GPT-4 has further narrowed.

The ability to understand Chinese and English is the basic skill of a large language model.

In terms of English tasks, Tongyi Qianwen 2.0 scored 82.5 on the MMLU benchmark, second only to GPT-4. By significantly increasing the number of parameters, Tongyi Qianwen 2.0 can better understand and process complex tasks. In terms of language structure and concepts; in terms of Chinese tasks, Tongyi Qianwen 2.0 achieved the highest score on the C-Eval benchmark with a clear advantage. This is because the model learned more Chinese corpus during training, further strengthening its Chinese understanding and expression capabilities.

In areas such as mathematical reasoning and code understanding, Tongyi Qianwen 2.0 has made significant progress. In the reasoning benchmark test GSM8K, Tongyi Qianwen ranked second, demonstrating strong computing and logical reasoning capabilities; in the HumanEval test, Tongyi Qianwen's score closely followed GPT-4 and GPT-3.5, which mainly measures large-scale The ability of the model to understand and execute code fragments is the basis for large models to be used in scenarios such as programming assistance and automatic code repair.

Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4

##Picture: Tongyi Qianwen 2.0release


##Tongyi Qianwen is more mature and easier to use. Tongyi Qianwen 2.0 has made technical optimizations in terms of instruction compliance, tool use, refined creation, etc. can be better integrated into downstream application scenarios. Tongyi Large Model official website has launched multi-modal and plug-in functions, supporting segmented tasks such as image input and document parsing.

At the same time, eight major industry model groups based on Tongyi large model training were launched. They are Tongyi Lingma-Intelligent Coding Assistant, Tongyi Zhiwen-AI Reading Assistant, Tongyi Listening-Work and Study AI Assistant. ##、Tongyi Xiaomi-Intelligent Customer Service Tongyi Renxin-Personal Exclusive health assistant , Tongyi Farui-AI legal advisor. 8 major industry models are oriented to the most popular vertical scenarios, using domain data for specialized training. Users can directly experience model functions on the official website, and developers can integrate model capabilities into their own large model applications and services through web page embedding, API/SDK calls, etc. Picture: Tongyi large model family has been fully upgraded, 8 major industry modelsgroups are online

Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4

As of October, Alibaba Cloud has conducted in-depth cooperation with more than 60 industry leaders , to promote the implementation of Tongyi Qianwen in the fields of office, cultural tourism, electric power, government affairs, medical insurance, transportation, manufacturing, finance, software development and other fields.

Zhou Jingren revealed that Alibaba Cloud plans to open source the 72B version of Tongyi Qianwen in the near future. Previously, Alibaba Cloud has open sourced the 7B and 14B version models, and the cumulative number of

model downloads Over 1 million

. Alibaba Cloud will continue to support developers from thousands of industries to innovate models and applications based on the Tongyi Qianwen open source model.

Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4

Picture: Tongyi Qianwen 72B will be open source soon

The above is the detailed content of Alibaba Cloud releases General Question Answering 2.0, which surpasses GPT-3.5 in performance and accelerates its pursuit of GPT-4. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
如何使用Python连接阿里云接口实现数据上传如何使用Python连接阿里云接口实现数据上传Jul 06, 2023 am 08:06 AM

如何使用Python连接阿里云接口实现数据上传概述:阿里云是一家领先的云计算服务提供商,提供了丰富的服务和接口,方便用户进行数据存储和处理。本文将介绍如何使用Python连接阿里云接口实现数据上传。步骤一:安装阿里云PythonSDK在开始之前,我们需要安装阿里云PythonSDK。打开终端,输入以下命令:pipinstallaliyun-pytho

从零开始:用Go语言对接阿里云接口的实战指南从零开始:用Go语言对接阿里云接口的实战指南Jul 05, 2023 pm 05:45 PM

从零开始:用Go语言对接阿里云接口的实战指南引言:作为一个云计算服务提供商,阿里云的接口为开发者提供了强大的功能和便利性。本文将介绍如何使用Go语言对接阿里云的接口,并提供了实战示例,帮助读者快速入门和上手。一、准备工作在开始对接阿里云接口之前,我们需要完成一些准备工作。注册阿里云账号:访问阿里云官网(https://www.aliyun.com),注册一个

如何使用PHP对接阿里云云盾接口实现网站防护功能如何使用PHP对接阿里云云盾接口实现网站防护功能Jul 05, 2023 pm 06:48 PM

如何使用PHP对接阿里云云盾接口实现网站防护功能随着互联网的快速发展,网站安全问题日益引起人们的关注。为了保障网站的安全性,防范黑客攻击和恶意代码注入等风险,适时采用一些安全防护工具是非常必要的。阿里云云盾是一种常用的云安全服务,提供多项安全防护功能。本文将介绍如何使用PHP对接阿里云云盾接口,实现网站的防护功能。一、准备工作在阿里云上购买云盾服务,并获取A

进阶Java技巧:使用阿里云函数计算快速搭建微服务进阶Java技巧:使用阿里云函数计算快速搭建微服务Jul 05, 2023 am 11:54 AM

进阶Java技巧:使用阿里云函数计算快速搭建微服务随着云计算的发展,微服务架构正在成为构建大型复杂应用的首选方案之一。在微服务架构中,每个功能模块都被拆分成一个个独立运行的微服务,通过基于HTTP的API接口进行通信。这种拆分和解耦的设计方式不仅提高了开发效率,还能实现应用的高可伸缩性和可维护性。在本文中,我将介绍如何使用阿里云函数计算(FunctionC

Java代码示例:利用阿里云DTS接口实现数据库同步Java代码示例:利用阿里云DTS接口实现数据库同步Jul 05, 2023 am 11:22 AM

Java代码示例:利用阿里云DTS接口实现数据库同步引言:随着云计算和大数据的快速发展,数据库同步成为了许多企业不可或缺的需求之一。阿里云的数据传输服务(DTS)提供了强大的数据库同步功能,能够帮助企业快速、高效地实现不同数据库之间的数据同步。本文将介绍如何利用阿里云DTS接口来实现数据库同步,并提供相应的Java代码示例。一、前期准备:在开始之前,我们需要

Python调用阿里云接口,实现数据清洗与可视化功能Python调用阿里云接口,实现数据清洗与可视化功能Jul 06, 2023 am 11:05 AM

Python调用阿里云接口,实现数据清洗与可视化功能引言:随着互联网的快速发展,数据已经成为了当今社会中不可或缺的一部分。然而,原始的数据通常是杂乱无章的,需要经过数据清洗的过程才能得到有用的信息。为了解决这个问题,阿里云提供了强大的数据处理和分析接口,本文将介绍如何使用Python调用阿里云接口,并将清洗过的数据进行可视化展示。一、准备工作在开始之前,需要

PHP与阿里云短信接口对接实战中的短信模板审核与发送频率控制技巧PHP与阿里云短信接口对接实战中的短信模板审核与发送频率控制技巧Jul 05, 2023 pm 07:42 PM

PHP与阿里云短信接口对接实战中的短信模板审核与发送频率控制技巧随着互联网的迅速发展,短信成为了一种重要的通信方式。无论是注册验证、支付提醒还是电商推广,短信都扮演着不可或缺的角色。阿里云短信接口作为业界知名的短信服务提供商,广泛应用于各种应用场景。在PHP语言中,如何对接并合理使用阿里云短信接口,是每个开发者需要掌握的技术。一、短信模板审核在使用阿里云短信

从零开始学习Java与阿里云CDN的对接技巧从零开始学习Java与阿里云CDN的对接技巧Jul 05, 2023 pm 06:11 PM

从零开始学习Java与阿里云CDN的对接技巧阿里云CDN(ContentDeliveryNetwork)是一种通过网络传输技术,将数据缓存在位于全球各地的分布式节点上,以提高数据访问速度的解决方案。在Java开发中,对接阿里云CDN可以极大地提升网站的访问速度和用户体验。本文将介绍如何从零开始学习Java与阿里云CDN的对接技巧,并提供一些代码示例。首先

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.