search
HomeTechnology peripheralsAIAfter completing the complete evaluation of GPT-4, Microsoft's hot paper said that the first version of AGI is coming soon

In 1956, at a seminar held at Dartmouth College, the concept of artificial intelligence was formally proposed.

The term has since challenged psychologists, philosophers and computer scientists because it is so difficult to define. In 1994, 52 psychologists published a joint paper trying to capture its essence.

As time went by, researchers began to shift their attention to AI systems in specific fields, such as the 2016 AlphaGo challenge to the Korean champion chess player and a great victory. Then, in the late 1990s and early 2000s, researchers were no longer satisfied with specialized AI, so calls for the development of more general artificial intelligence systems grew. Along with this, the term Artificial General Intelligence (AGI) began to gain popularity in the early 2000s.

In recent times, as you have seen, large language models (LLM) have come into the spotlight. These neural networks are based on the Transformer architecture and trained on large text data sets. In particular, OpenAI's latest release of GPT-4 demonstrates the versatility of large-scale language models and is proficient in mathematics, writing, law, medicine and other fields.

We can’t help but ask, is GPT-4 an important step towards AGI?

The answer given by Microsoft is yes. In a recently released paper, they elaborated on this point of view. This article provides a comprehensive evaluation of GPT-4. Microsoft believes that "given the breadth and depth of GPT-4's capabilities, we believe it should reasonably be considered an early (but still incomplete) version of an artificial general intelligence (AGI) system."

Microsoft also stated, "The main goal of this article is to explore the capabilities and limitations of GPT-4. We believe that the intelligence of GPT-4 marks a true paradigm shift in computer science and other fields."

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Paper address: https://arxiv.org/pdf/2303.12712.pdf

Interesting Unfortunately, this popular paper was also found to have a lot of deletions, so someone found the unabridged version of the paper.

In the uncut version, this blogger also revealed a lot of hidden details. For example, the internal name of GPT-4 is DV-3, which is actually the hidden part of the paper. The third author was later deleted; these Microsoft researchers did not seem to know much about the technical details of GPT-4. In addition, the blogger also revealed that the part about toxic content was deleted when this paper was published (to prevent negative effects on OpenAI?).

We have pasted the blogger’s Twitter thread below for those who are interested to check it out.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

##Twitter thread: https://twitter.com/DV2559106965076/status/1638769434763608064

Back to the article itself.

According to the article, AGI is the specific ability to reason, plan, solve problems, think abstractly, understand complex ideas, learn quickly, and learn from experience. Starting from these capabilities, the paper conducts interesting experiments and evaluations.

The paper is divided into 10 chapters: Chapter 1 is the general part; Chapter 2 introduces multi-modality, mainly related to visual generation content; Chapter 3 code, generated according to instructions Code, understanding existing code; Chapter 4 Mathematical Ability; Chapter 5 Interaction with the World; Chapter 6 Interaction with Humans; Chapter 7 Discrimination; Chapter 8 GPT-4 Limitations; Chapter 9 Social Impact; Chapter 10: Future Directions and Conclusions.

Let’s use specific examples to see if GPT-4 has really entered the AGI era.

Multimodal and interdisciplinary composition

To test the model’s ability to combine art with programming, the study asked GPT-4 to write a piece of code in JavaScript to Generate Kandinsky-style random images. The first picture below was created by Wassily Kandinsky, and the second and third pictures were generated by GPT-4 and ChatGPT respectively:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

The following is the GPT-4 code implementation process:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Perform visual concept understanding: In this drawing task, input prompts to let the model combine the shapes of the letters Y, O, and H to draw a person. In fact, in the training process of GPT-4, there is no knowledge about the shape of letters. It can only vaguely learn that letters are related to some specific shapes from relevant training data. The results show that the results generated by GPT-4 are not bad:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

For sketch generation: GPT-4 can also be combined with Stable Diffusion. The picture below is a screenshot of 3D city modeling. The input prompt has a river flowing from left to right, a desert with pyramids built next to the river, and 4 buttons at the bottom of the screen, the colors are green, blue, brown and red. The following is the generated result:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

## You can also ask GPT-4 to generate and modify tunes using ABC notation:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Programming ability

GPT-4 has very strong programming ability, including writing code according to instructions and understanding existing code. The study specifically tested GPT-4's programming capabilities.

Code writing

Figure 3.1 below is an example of letting GPT-4 write python functions. This study Use LeetCode to determine whether the code is correct online.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

The study then let GPT-4 visualize the accuracy data on LeetCode in Table 2 above As a graph, the results are shown in Figure 3.2 below.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

##Front-end/Game Development## As shown in Figure 3.3 below, the study asked GPT-4 to write a 3D game in HTML using JavaScript. GPT-4 generated a game that met all requirements with zero samples.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Deep Learning Programming

Writing code for deep learning requires math , statistical knowledge, and familiarity with frameworks and libraries such as PyTorch, TensorFlow, Keras, etc. As shown in Figure 3.4 below, researchers require GPT-4 and ChatGPT to write custom optimizer modules, a task that is also challenging for human deep learning experts. Researchers provide natural language descriptions for GPT-4 and ChatGPT, which include a series of important operations, such as applying SVD and so on.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soonIn addition, the study also tested GPT-4’s ability to convert code into LaTex formulas, and the results are shown in Figure 3.5 below.

In terms of understanding code, this research attempts to let GPT-4 and ChatGPT "understand" a C/C program and predict the output results of the program. The performance of the two is as follows:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

The study then asked GPT-4 to interpret a piece of Python code:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Also There is a piece of pseudo code to explain:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Mathematical ability

For a long time, the mathematical ability of large language models has not seemed to be very good. So how does GPT-4 perform in this regard? After a series of evaluations in this article, the results show that GPT-4 has made a qualitative leap in mathematics compared to previous models, but it is still far from the expert level and does not have the ability to do mathematical research.

In comparison with ChatGPT, GPT-4 successfully generated the solution, while ChatGPT generated the wrong answer:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

On the AP issue, GPT-4 vs ChatGPT comparison results. GPT-4 used the correct approach, but a computational error led to the wrong final answer, while ChatGPT produced an incoherent argument.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

In addition, this article also tests GPT-4’s ability to use mathematical thinking and technology to solve real-world problems: The figure below shows how GPT-4 Successfully constructing a reasonable mathematical model for a complex system that requires extensive interdisciplinary knowledge, ChatGPT fails to make meaningful progress.

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

Since the paper is 154 pages long, this article only displays a large number of evaluation results. For more information, readers can refer to the original paper.

Finally, attach the table of contents:

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon

After completing the complete evaluation of GPT-4, Microsofts hot paper said that the first version of AGI is coming soon# #

The above is the detailed content of After completing the complete evaluation of GPT-4, Microsoft's hot paper said that the first version of AGI is coming soon. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
GPT-4接入Office全家桶!Excel到PPT动嘴就能做,微软:重新发明生产力GPT-4接入Office全家桶!Excel到PPT动嘴就能做,微软:重新发明生产力Apr 12, 2023 pm 02:40 PM

一觉醒来,工作的方式被彻底改变。微软把AI神器GPT-4全面接入Office,这下ChatPPT、ChatWord、ChatExcel一家整整齐齐。CEO纳德拉在发布会上直接放话:今天,进入人机交互的新时代,重新发明生产力。​新功能名叫Microsoft 365 Copilot(副驾驶),与改变了程序员的代码助手GitHub Copilot成为一个系列,继续改变更多人。现在AI不光能自动做PPT,而且能根据Word文档的内容一键做出精美排版。甚至连上台时对着每一页PPT应该讲什么话,都给一起安排

集成GPT-4的Cursor让编写代码和聊天一样简单,用自然语言编写代码的新时代已来集成GPT-4的Cursor让编写代码和聊天一样简单,用自然语言编写代码的新时代已来Apr 04, 2023 pm 12:15 PM

集成GPT-4的Github Copilot X还在小范围内测中,而集成GPT-4的Cursor已公开发行。Cursor是一个集成GPT-4的IDE,可以用自然语言编写代码,让编写代码和聊天一样简单。 GPT-4和GPT-3.5在处理和编写代码的能力上差别还是很大的。官网的一份测试报告。前两个是GPT-4,一个采用文本输入,一个采用图像输入;第三个是GPT3.5,可以看出GPT-4的代码能力相较于GPT-3.5有较大能力的提升。集成GPT-4的Github Copilot X还在小范围内测中,而

GPT-4的两个谣言和最新预测!GPT-4的两个谣言和最新预测!Apr 11, 2023 pm 06:07 PM

​作者 | 云昭3月9日,微软德国CTO Andreas Braun在AI kickoff会议上带来了一个期待已久的消息:“我们将于下周推出GPT-4,届时我们将推出多模式模式,提供完全不同的可能性——例如视频。”言语之中,他将大型语言模型(LLM)比作“游戏改变者”,因为他们教机器理解自然语言,然后机器以统计的方式理解以前只能由人类阅读和理解的东西。与此同时,这项技术已经发展到“适用于所有语言”:你可以用德语提问,也可以用意大利语回答。借助多模态,微软(-OpenAI)将“使模型变得全面”。那

再一次改变“AI”世界 GPT-4千呼万唤始出来再一次改变“AI”世界 GPT-4千呼万唤始出来Apr 10, 2023 pm 02:40 PM

近段时间,人工智能聊天机器人ChatGPT刷爆网络,网友们争先恐后去领略它的超高情商和巨大威力。参加高考、修改代码、构思小说……它在广大网友的“鞭策”下不断突破自我,甚至可以用一整段程序,为你拼接出一只小狗。而这些技能只是基于GPT-3.5开发而来,在3月15日,AI世界再次更新,最新版本的GPT-4也被OpenAI发布了出来。与之前相比,GPT-4不仅展现了更加强大的语言理解能力,还能够处理图像内容,在考试中的得分甚至能超越90%的人类。那么,如此“逆天”的GPT-4还具有哪些能力?它又是如何

当GPT-4反思自己错了:性能提升近30%,编程能力提升21%当GPT-4反思自己错了:性能提升近30%,编程能力提升21%Apr 04, 2023 am 11:55 AM

GPT-4 的思考方式,越来越像人了。 人类在做错事时,会反思自己的行为,避免再次出错,如果让 GPT-4 这类大型语言模型也具备反思能力,性能不知道要提高多少了。众所周知,大型语言模型 (LLM) 在各种任务上已经表现出前所未有的性能。然而,这些 SOTA 方法通常需要对已定义的状态空间进行模型微调、策略优化等操作。由于缺乏高质量的训练数据、定义良好的状态空间,优化模型实现起来还是比较难的。此外,模型还不具备人类决策过程所固有的某些品质,特别是从错误中学习的能力。不过现在好了,在最近的一篇论文

「数学天才」陶哲轩:GPT-4无法攻克一个未解决的数学问题,但对工作有帮助「数学天才」陶哲轩:GPT-4无法攻克一个未解决的数学问题,但对工作有帮助Apr 10, 2023 pm 02:21 PM

当红炸子鸡ChatGPT,也成为数学天才陶哲轩的研究工具了。近日,他在网上称自己发现了一些ChatGPT的小用例。首先,它很擅长解析代码格式的文档(在这种情况下是#arXiv搜索的API),然后返回一个正确格式的代码查询(后来它还提供了一些工作的python代码,以我要求的方式调用这个API,尽管我不得不手动安装一个包来使它运行)。其次,我让它想出一些,聪明的学生在本科线性代数课上可能会问的问题(为此我提供了一些样本题目),它给出了一些很好的例子,让我对课程可能方向,以及潜在的作业问题有所启发。

微软 Bing Chat 聊天机器人已升级使用最新 OpenAI GPT-4 技术微软 Bing Chat 聊天机器人已升级使用最新 OpenAI GPT-4 技术Apr 12, 2023 pm 10:58 PM

3 月 15 日消息,今天 OpenAI 发布了全新的 GPT-4 大型语言模型,随后微软官方宣布,Bing Chat 此前已经升级使用 OpenAI 的 GPT-4 技术。微软公司副总裁兼消费者首席营销官 Yusuf Mehdi 确认 Bing Chat 聊天机器人 AI 已经在 GPT-4 上运行,ChatGPT 基于最新版本 GPT-4,由 OpenAI 开发 。微软 Bing 博客网站上的一篇帖子进一步证实了这一消息。微软表示,如果用户在过去五周内的任何时间使用过新的 Bing 预览版,

GPT-4的早期实验,通用人工智能的火花GPT-4的早期实验,通用人工智能的火花Apr 07, 2023 pm 08:01 PM

最近,微软发布了一个长达154页的论文名称为《通用人工智能的火花,GPT-4的早期实验》。文章的主要观点是虽然GPT-4还不完整,但是已经可以被视为,一个通用人工智能的早期版本。由于全文将近7万字,本篇将论文的内容做了一下精炼和解读,有兴趣可阅读原文 https://arxiv.org/pdf/2303.12712.pdf来自微软的科学家们认为,GPT-4的智能水平已经非常接近于人类的水平,而且远超之前的诸如先前ChatGPT用的GPT-3.5这样的模型,可以将GPT-4视为通用人工智能系统,也

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
Repo: How To Revive Teammates
1 months agoBy尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

SublimeText3 Linux new version

SublimeText3 Linux new version

SublimeText3 Linux latest version

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools