search
HomeTechnology peripheralsAIClaude3 taught GPT4 a lesson! Open AI's strongest opponent is a late-night bomb, with full picture analysis!

Produced by 51CTO Technology Stack (WeChat ID: blog51cto)

Late at night, OpenAI’s strongest rival Anthropic released three new SOTA products in one go, including: Claude 3 Haiku, Claude 3 Sonnet and Claude 3 Opus. The industry exclaimed: Claude 3 brings a series of new industry benchmarks.

Among them, Opus means an epic movement, Sonnet means a sonnet, and Haiku means a haiku, a three-line short poem.

It is reported (Figure 9) that Haiku is the fastest and most cost-effective model in the AI ​​market. It can read informative and data-intensive research papers (~10k tokens) on arXiv containing charts and graphs in less than three seconds.

Figure 8, for the vast majority of workloads, Sonnet is 2 times faster than Claude 2 and Claude 2.1, and has a higher level of intelligence. It excels at tasks that require fast responses, such as knowledge retrieval or sales automation. Opus is similar in speed to Claude 2 and 2.1, but has a higher level of intelligence.

Opus is Anthropic’s smartest model currently, outperforming its peers on most common evaluation benchmarks for artificial intelligence systems, including Undergraduate Level Expert Knowledge (MMLU), Graduate Level Expert Reasoning (GPQA), and Basic Mathematics (GSM8K) etc. It demonstrates near-human-level understanding and fluency on complex tasks, leading the frontier of general intelligence.

The above three Claude 3 models all show greater capabilities in analysis and prediction, detailed content creation, code generation, and conversation in non-English languages ​​such as Spanish, Japanese, and French.

In addition, the Claude 3 series has near-perfect memory capabilities and ultra-long text windows, which will provide 200K context windows. And it can accept the input of more than 1 million tokens, and will be gradually released according to customer needs.

Overall, Claude3 has three shocking features:

1. The benchmark of domain experts. Three expert fields of finance/medicine/philosophy were selected as test benchmarks. NVIDIA Research Manager Jim Fan said that "it is recommended that all LLMs follow this so that different downstream applications know what will happen."

2. Rejection rate analysis. LLM has become an epidemic with overly cautious answers to many "innocent questions". Claude has been committed to safe AI research and has made efforts in this area.

3. Complex visual features comparable to other leading models. Works with a variety of visual formats including photos, diagrams, graphics and technical diagrams, PDFs, flowcharts or presentation slides.

How is the actual evaluation?
A netizen asked GPT4 and Claude3 to write a login interface code respectively. The actual running test results (Figure 13) show that the latter is better in App interface design.

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

##

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

Claude3给GPT4上了一课!Open AI最强对手深夜炸弹,附全图解析!

The above is the detailed content of Claude3 taught GPT4 a lesson! Open AI's strongest opponent is a late-night bomb, with full picture analysis!. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
得益于OpenAI技术,微软必应的搜索流量超过谷歌得益于OpenAI技术,微软必应的搜索流量超过谷歌Mar 31, 2023 pm 10:38 PM

截至3月20日的数据显示,自微软2月7日推出其人工智能版本以来,必应搜索引擎的页面访问量增加了15.8%,而Alphabet旗下的谷歌搜索引擎则下降了近1%。 3月23日消息,外媒报道称,分析公司Similarweb的数据显示,在整合了OpenAI的技术后,微软旗下的必应在页面访问量方面实现了更多的增长。​​​​截至3月20日的数据显示,自微软2月7日推出其人工智能版本以来,必应搜索引擎的页面访问量增加了15.8%,而Alphabet旗下的谷歌搜索引擎则下降了近1%。这些数据是微软在与谷歌争夺生

ChatGPT出现隐私漏洞,可能泄露用户和聊天机器人的对话标题ChatGPT出现隐私漏洞,可能泄露用户和聊天机器人的对话标题Apr 07, 2023 pm 11:21 PM

Reddit和Twitter上的用户从3月20日开始报告了ChatGPT的一个漏洞,并发布了一些屏幕截图,显示他们的ChatGPT网页历史记录中包含他们不熟悉的对话标题。虽然以这种方式似乎无法访问共享聊天内容,但OpenAI公司在关闭该漏洞时完全删除了聊天历史记录。根据行业媒体的报道,ChatGPT在当天还出现了重大中断,那些可以访问的用户注意到提供了不一致的服务。OpenAI公司在其状态页面上记录了中断情况,并在最初报告的几个小时内恢复了服务。OpenAI公司的首席执行官 Sam Altman

LLM之战,谷歌输了!越来越多顶尖研究员跳槽OpenAILLM之战,谷歌输了!越来越多顶尖研究员跳槽OpenAIApr 07, 2023 pm 05:48 PM

​前几天,谷歌差点遭遇一场公关危机,Bert一作、已跳槽OpenAI的前员工Jacob Devlin曝出,Bard竟是用ChatGPT的数据训练的。随后,谷歌火速否认。而这场争议,也牵出了一场大讨论:为什么越来越多Google顶尖研究员跳槽OpenAI?这场LLM战役它还能打赢吗?知友回复莱斯大学博士、知友「一堆废纸」表示,其实谷歌和OpenAI的差距,是数据的差距。「OpenAI对LLM有强大的执念,这是Google这类公司完全比不上的。当然人的差距只是一个方面,数据的差距以及对待数据的态度才

CIO分享:企业IT应谨慎使用生成式AI向前发展CIO分享:企业IT应谨慎使用生成式AI向前发展Apr 11, 2023 pm 03:49 PM

Vince Kellen是美国加州大学圣地亚哥分校(UCSD)的首席信息官,他深知ChatGPT、DALL-E和其他生成式AI技术有据可查的局限性:生成的答案可能并不真实,生成的图像也可能缺乏完整性,输出可能存在偏差。但无论如何他都在向前推进,他表示,员工们已经在使用ChatGPT来编写代码和工作内容描述了。OpenAI的文本生成技术ChatGPT以及图像生成技术DALL-E在一系列吸引了公众想象力的大型语言模型(也称为生成语言模型或者生成式AI)中是最突出的,这些模型响应书面请求以生成从文本文

美媒担忧:ChatGPT们生成的摘要足够好,读者不来看新闻怎么办美媒担忧:ChatGPT们生成的摘要足够好,读者不来看新闻怎么办Apr 08, 2023 pm 11:31 PM

据报道,美国新闻行业正将AI聊天机器人​视为一种新的生存威胁。他们担心人们会认为聊天机器人提供的文章摘要已经足够好,从而不再访问他们的网站,致使读者和广告商流失。然而,也有媒体高管认为,尽管存在潜在的威胁,但也有机会。他们正试图在行业变革中领先一步,以适应读者获取信息方式的演变。以下是翻译内容当你向微软Bing聊天机器人询问美国前总统唐纳德·特朗普(Donald Trump)是否被起诉时,它的回答会让传媒高管们感到害怕。机器人给出的三句摘要似乎很有用,它不仅提供了CNN、华盛顿邮报等新闻媒体的链

ChatGPT技术国产化尝试ChatGPT技术国产化尝试Apr 08, 2023 am 11:31 AM

本次分享题目为 ChatGPT 技术、国产化尝试和开源模型。分享包含三大部分的内容,第一部分总体介绍 ChatGPT 相关的技术:ChatGPT 技术的演进、目前存在什么样的问题、ChatGPT 技术学习的三个阶段、数据组织和效果评估;第二部分分享我们在 ChatGPT 技术国产化方面进行的尝试,包含实验过程中我们遇到的问题、进行的思考以及模型的效果和应用;第三部分介绍我们已经发布的中文开源大模型,使用自有数据训练出本地模型如何进行操作,在实验过程中可能遇到的问题,和开源的先进模型相比存在的差距

用ChatGPT秒建大模型!OpenAI全新插件杀疯了,接入代码解释器一键get用ChatGPT秒建大模型!OpenAI全新插件杀疯了,接入代码解释器一键getApr 04, 2023 am 11:30 AM

ChatGPT可以联网后,OpenAI还火速介绍了一款代码生成器,在这个插件的加持下,ChatGPT甚至可以自己生成机器学习模型了。 ​上周五,OpenAI刚刚宣布了惊爆的消息,ChatGPT可以联网,接入第三方插件了!而除了第三方插件,OpenAI也介绍了一款自家的插件「代码解释器」,并给出了几个特别的用例:解决定量和定性的数学问题;进行数据分析和可视化;快速转换文件格式。此外,Greg Brockman演示了ChatGPT还可以对上传视频文件进行处理。而一位叫Andrew Mayne的畅销作

GPT-4掀起新一轮AI风暴,被围堵的文心一言能否一战?GPT-4掀起新一轮AI风暴,被围堵的文心一言能否一战?Apr 11, 2023 pm 05:43 PM

将文心一言发布时间定在3月16日的百度,没能预料到会遭到来自OpenAI、谷歌、微软的轮番轰炸:先是3月15日凌晨,OpenAI发布大型多模态Transformer模型GPT-4;紧接着,宣布开放大规模语言模型PaLM的API接口,并推出面向开发者的工具MakerSuite;文心一言发布之后,巨头们也并没有歇着,3月16日晚间,微软更是发布由AI驱动的办公神器Microsoft 365 Copilot,号称让Word、PPT、Excel、OutLook、协同办公软件的生产力都飙增。文心一言对标C

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),