search
HomeTechnology peripheralsAIGrok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket

GPT-5 is not out yet, Grok has caught up.

On the same day that Google and OpenAI were grabbing news from each other, Musk’s xAI was not idle either.

On Wednesday afternoon Beijing time, xAI officially released the new generation Grok 2 large model.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Chatbot Arena, a third-party large model benchmark organization, also immediately updated the results list of the LMSYS list. The early model of Grok 2 (sus-column-r) can be ranked fourth just behind GPT-4o (version 0513), outperforming Claude 3.5 Sonnet and GPT-4-Turbo.

It excels at coding, complex problems and math.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Musk couldn't help but boast, "Grok's propulsion speed is like a rocket."
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Note that this is only the score of the early version. Chatbot Arena said that it will test the official version in the future.

Musk said that Grok-2 is an advanced language model with the most advanced reasoning capabilities. The new generation includes two versions: Grok-2 and Grok-2 mini. Both models are now released on the X platform for Grok users. Currently, X Premium and Premium+ users can already experience the Grok-2 and Grok-2 mini models.

Compared with the previous Grok-1.5, the early preview version of Grok-2 has achieved significant progress, demonstrating leading capabilities in chat, reasoning, coding, etc. Grok-2 and Grok-2 mini are currently in beta on X and will be available via an enterprise API later this month, xAI said.

Less than half an hour after the new model was released, a netizen was already showing off the results. He used Grok 2 mini to generate an image of "Me and Musk eating hot dogs."
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Try other methods to generate a portrait of Washington.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Some people also tried Grok 2 mini to generate a flying cat.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Someone else built a Tesla Model Y, does it look similar?
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Grok-2 performance PK

With xAI putting the early version of Grok-2 "sus-column-r" into Chatbot Arena, we see it competing with other popular switches Performance comparison of source models.

In terms of overall Elo score, Grok-2 performs better than Claude series models and most versions of GPT-4. Of course, the first one on the list is GPT-4o (version August 8), which OpenAI just released these days.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
The picture below shows the Win Rate comparison between Grok-2 and other popular models.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
The picture below shows a fact-based win rate comparison between the two versions of Grok 1.5 and Grok 2.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
xAI adopts this process to evaluate the Grok 2 model, using AI Tutors to truly interact with the model in various tasks. During each interaction, Grok 2 provides two responses to the AI ​​Tutors and then selects the best response based on specific criteria listed in the guide.

xAI focuses on evaluating model performance in two key areas, namely instruction following and providing accurate, authentic information. The results show significant improvements in Grok 2's ability to reason from retrieved content and use tools such as correctly identifying missing information, reasoning through sequences of events, discarding irrelevant posts, etc.

Benchmark Scores

xAI evaluated the Grok-2 model across a range of academic benchmarks including Reasoning, Reading Comprehension, Mathematics, Science, and Coding.

Both the Grok-2 and Grok-2 mini are significant improvements over the previous Grok-1.5 model. Performance is comparable to other cutting-edge models in areas such as graduate-level science knowledge (GPQA), general knowledge (MMLU, MMLU-Pro), and mathematics competition problems (MATH).

In addition, Grok-2 also performs well on vision-based tasks, with remarkable performance in visual mathematical reasoning (MathVista) and document-based question answering (DocVQA).
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Grok 2 interface and functionality "big makeover"

In the past few months, xAI has been continuously improving the Grok experience on the x platform. Now, with the launch of the next generation Grok 2, xAI has redesigned the interface, as shown below.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Of course, xAI provides some new features, such as a simple implementation of Conway's "Game of Life".
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Another example is multi-modal understanding ability (looking at pictures and talking).
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Among them, Grok-2 is xAI’s most advanced AI assistant, with text and visual understanding capabilities and integrated real-time information from the X platform, accessible through the Grok tab in the X application.

The Grok-2 mini is a small but powerful model that strikes a good balance between speed and answer quality.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket

Compared to its predecessor, Grok-2 is more intuitive, more controllable and more flexible, suitable for a variety of tasks, whether you are looking for answers, collaborative writing or solving coding tasks.

Additionally, xAI is partnering with startup Black Forest Labs to experiment with their FLUX.1 model to extend Grok’s capabilities on X.
Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket
Later this month, xAI will also release Grok-2 and Grok-2 mini to developers via a new enterprise API platform. The upcoming API is built on a new custom technology stack that allows for multi-region inference deployments for global low-latency access.

Of course, xAI also offers some enhanced security features, such as mandatory multi-factor authentication (e.g. using Yubikey, Apple TouchID or TOTP).

As you can see, since the launch of Grok-1 in November 2023, xAI has been advancing this series of models at an alarming rate. Soon, they will release a preview version with multi-modal understanding. The focus after xAI will be to improve the core reasoning capabilities of the model through new computing clusters.

Blog address: https://x.ai/blog/grok-2

The above is the detailed content of Grok-2 is here, it can generate images and recognize images, and its performance is comparable to GPT-4o. Musk: It is developing like a rocket. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
人体试验要泡汤?马斯克Neuralink面临联邦调查,实验动物死亡频发人体试验要泡汤?马斯克Neuralink面临联邦调查,实验动物死亡频发Apr 12, 2023 pm 05:37 PM

上周,马斯克举办了 Neuralink 的​​ Show & Tell ​​​演示活动,向世人展示了脑机接口的最新进展。会上,马斯克表示,从原型到生产非常困难,面临诸多挑战。Neuralink 一直在努力启动人体试验,并且已向 FDA 提交了开始人体试验所需的所有文件。马斯克估计,第一个 Neuralink 设备可能会在 5-6 个月内进入人脑。会上马斯克强调, Neuralink 尊重动物受试者,并且脑机接口设备植入动物体内之前已经进行了广泛的基准测试。两只猴子 Pager 和

马斯克反讽人工智能AI炒作:“机器学习”本质就是统计马斯克反讽人工智能AI炒作:“机器学习”本质就是统计Jun 13, 2023 pm 12:13 PM

驱动中国2023年6月12日消息,近日,特斯拉CEO埃隆·马斯克周六在推特上发布了一张图片,疑似讽刺当前关于“人工智能”的炒作现象。图文显示,一位戴着“MachineLearning”面罩的路人,将其面罩摘下是一张写着“Statistics(统计)”的面孔。寓意当前大火的人工智能AI本质就是数据统计的结果。值得注意的是,持此类意见的科技领袖恐不在少数,之前马斯克曾与苹果联合创始人史蒂夫沃兹尼亚克以及上千名AI研究人员联署公开信,呼吁暂停研究更先进的AI技术。然而,此信遭到许多专家甚至签名者的质疑

X 网站出现无法屏蔽的广告,会诱导用户点击X 网站出现无法屏蔽的广告,会诱导用户点击Oct 10, 2023 pm 04:37 PM

根据Mashable的报道,据称在X网站(原名Twitter)的移动应用程序中,用户在他们的ForYou信息流中发现了一些没有标注的广告。当用户点击这些广告时,会跳转到其他网站,而且没有办法屏蔽或举报这些广告这些新出现的广告与普通广告有所不同。普通广告只是来自X网站账号的帖子,并且带有一个“Ad”的标签。而这些新广告没有与之相关联的账号,仅由书面文本、照片和虚假头像组成,使其看起来像正常的推文。它们旨在诱导用户点击。以下是它们的样子:如果用户只是随意地滑动屏幕,嵌入的图片和吸引眼球的文本可能会让

马斯克,脑机接口,第一刀马斯克,脑机接口,第一刀Jun 04, 2023 am 09:49 AM

从“硅谷钢铁侠”到“现实钢铁侠”,马斯克成为“人类托尼・史塔克”,正在逐渐成为现实。就在几天前,马斯克脑机接口公司Neuralink宣布迎来重大进展——已经获得美国食品和药物管理局(FDA)的批准,将启动其首个人体临床研究,这意味着,他们的设备将植入人类的大脑中。据悉他们会专注于两个应用:恢复人类视力,帮助无法移动肌肉的人控制智能手机等设备。在去年11月,马斯克曾放出豪言,称Neuralink距离首次人体试验还有大约6个月的时间。可是后来,由于安全风险大、违反动物权益、涉嫌非法运输危险病原体..

新推特CEO曾为马斯克工作20年,带全家居住在办公室!新推特CEO曾为马斯克工作20年,带全家居住在办公室!May 06, 2023 pm 08:43 PM

自从马斯克在推特上搞的关于「自己要不要辞职CEO」的投票结果出炉以来,从媒体到公众都在关心一个问题:​谁来接班?据外媒theinformation报道,接班人可能是一位马斯克的忠实拥护者,马斯克的隧道挖掘公司TheBoringCompany的CEO,宇航工程师SteveDavis。据TheInformation报道,今年43岁SteveDavis是和马斯克一样,也是一位「拼命三郎」。此前,他已经在Twitter总部的办公室里睡了两个月。而且还是和他刚刚分娩的妻子,和他们刚刚出生的孩子一起搬过来

马斯克:致力于在 X 平台上实现“真正的金钱”运作,不计划推出自家加密货币马斯克:致力于在 X 平台上实现“真正的金钱”运作,不计划推出自家加密货币Sep 14, 2023 pm 09:53 PM

马斯克昨日在X平台上引用了此前设计了X平台图标的设计师的发言。他表示,X平台将专注于让真正的金钱在平台上运行,同时不会开发自己的加密货币据悉,X用户“DogeDesigner”昨日发布贴文声称:“X不会推出任何X币。该团队更专注于让真正的金钱在这个App上运作,而不是一些‘替代货币’。”▲图源X用户“DogeDesigner”的发言对此,马斯克回应称:“正确”。▲图源马斯克的发言我们之前报道过,这位“DogeDesigner”实际上是一家加密货币公司的首席执行官,同时也是X平台图标的设计师。马斯

马斯克返老还童?实际是AI生成的本人婴儿照马斯克返老还童?实际是AI生成的本人婴儿照Jun 07, 2023 am 10:00 AM

2023-06-0705:29:10作者:人宝宝最近,一张由AI生成的马斯克婴儿照片在社交媒体上流传开来,引起了网友热议和质疑。据悉,这张照片是一个名为“NotJeromePowel”的用户分享的。在发布时,他幽默地说:“据报道,埃隆·马斯克正在研究一种抗衰老配方,但结果失控了。”该照片很快引起了广泛关注,并获得了数万个点赞。马斯克本人也加入了这场对话,他以幽默的方式回应了粉丝们:“伙计们,我想我可能吃太多了。”并附有一个婴儿表情符号。发布照片的用户同样风趣回应称:“这样你就有足够的时间去火星了

马斯克计划与扎克伯格对决,但后者未在家且不会与其对打马斯克计划与扎克伯格对决,但后者未在家且不会与其对打Aug 17, 2023 pm 09:29 PM

据报道,马斯克对于扎克伯格不认真对待他们之间的争斗后,决定放弃笼斗计划。然而,马斯克却表示他将会突然出现在Meta首席执行官马克·扎克伯格的家门口,并进行一场拳击对决马斯克今天在X上发帖表示:“今晚我将在帕洛阿尔托进行特斯拉FSD试驾,计划将车开到@finkd的家。”Finkd是扎克伯格在X上的用户名。“如果我们有幸而扎克伯格真的开门,那么战斗就开始了!”马斯克还表示他将在X上直播这次“冒险”扎克伯格对此并不感冒,他的发言人伊斯卡・萨里克(IskaSaric)告诉《TheVerge》,“马克现在

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

SAP NetWeaver Server Adapter for Eclipse

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

Atom editor mac version download

Atom editor mac version download

The most popular open source editor

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.