


Sure enough, the only one that can beat yesterday’s OpenAI is today’s OpenAI.
Just now, OpenAI has shockingly released the large-scale multi-modal model GPT-4, which supports the input of images and text and generates text results.
Known as the most advanced AI system in history!
GPT-4 not only has the eyes to understand pictures, but also achieved almost perfect scores in all major exams including GRE, sweeping the Various benchmarks and performance indicators are overwhelming.
OpenAI spent 6 months iteratively adjusting GPT-4 using adversarial testing procedures and lessons learned from ChatGPT, thus achieving great results in terms of authenticity and controllability. Best results ever.
## Everyone still remembers that Microsoft and Google fought fiercely for three days in early February. On February 8, Microsoft released ChatGPT. At that time, Bing said it was "based on ChatGPT-like technology."
Today, the mystery was finally solved - the big model behind it is GPT-4!
Geoffrey Hinton, one of the three giants of the Turing Award, marveled at this, "After the caterpillar absorbs nutrients, it will Turn the cocoon into a butterfly. And mankind has extracted billions of gold nuggets of understanding, GPT-4, which is the butterfly of mankind."
By the way, ChatGPT Plus users can now Get started.
In a casual conversation, GPT-3.5 and The differences between GPT-4 are subtle. Only when the complexity of the task reaches a sufficient threshold does the difference emerge, with GPT-4 being more reliable, more creative, and able to handle more nuanced instructions than GPT-3.5.
To understand the differences between the two models, OpenAI was tested on various benchmarks and some mock exams designed for humans.
GPT-4 Among the various exams, there are several tests that are almost close to perfect scores:
- USABO Semifinal 2020 (United States Biology Olympiad Competition )
- GRE Writing
Additionally, OpenAI evaluated GPT-4 on traditional benchmarks designed for machine learning models. Judging from the experimental results, GPT-4 is much better than existing large-scale language models, as well as most SOTA models:
In addition, GPT-4’s performance in different languages: The accuracy of Chinese is about 80%, which is already better than the English performance of GPT-3.5.
Many existing ML benchmarks are written in English. To get a first look at GPT-4's capabilities in other languages, the researchers used Azure Translate to translate the MMLU benchmark (a set of 14,000 multiple-choice questions covering 57 topics) into multiple languages.
In 24 of the 26 languages tested, GPT-4 outperformed GPT-3.5 and other large language models (Chinchilla, PaLM) in English language performance:
OpenAI stated that it uses GPT-4 internally, so it also pays attention to the application effect of large language models in content generation, sales, and programming. Additionally, insiders use it to help humans evaluate AI output.
In this regard, Jim Fan, a disciple of Li Feifei and NVIDIA AI scientist, commented: "The strongest thing about GPT-4 is actually its reasoning ability. Its scores on GRE, SAT, and law school exams , almost no different from human candidates. In other words, GPT-4 can be admitted to Stanford on its own."
(Jim Fan himself graduated from Stanford!)
Netizen: It’s over. Once GPT-4 is released, we humans will no longer be needed...
Read pictures and do small cases, and even know the jokes better than netizens
The highlight of this upgrade of GPT-4 is of course multi-modality.
GPT-4 can not only analyze and summarize graphic icons, but can even read memes and explain where the memes are and why they are funny. In this sense, it can even kill many humans instantly.
OpenAI claims that GPT-4 is more creative and collaborative than previous models. It can generate, edit, and iterate users for creative and technical writing tasks, such as composing songs, writing screenplays, or learning the user's writing style.
GPT-4 can take images as input and generate captions, classifications, and analyses. For example, give it a picture of ingredients and ask it what it can do with these ingredients.
##In addition, GPT-4 can handle more than 25,000 words of text, allowing for long-form content creation, extended conversations, document search and analysis.
GPT-4 surpasses ChatGPT in its advanced inference capabilities. As follows:
Meme recognition
For example, show it a strange meme picture, and then ask what is funny in the picture.
After GPT-4 is obtained, it will first analyze the content of a wave of pictures and then give answers.
For example, analyze the following picture by picture.
GPT-4 immediately reacted: The "Lighting charging cable" in the picture looks like a big one The outdated VGA interface, plugged into this small and modern smartphone, has a strong contrast.
Given such a meme, where is the GPT-4 meme?
It replied fluently: The funny thing about this meme is that "the picture and text don't match".
The text clearly states that it is a photo of the Earth taken from space, but the picture is actually just a bunch of chicken nuggets arranged like a map.
GPT-4 can still understand comics: Why do we need to add layers to the neural network?
##It hits the nail on the head. This cartoon satirizes the differences between statistical learning and neural networks in improving model performance. .
Chart Analysis
Average Daily Meat for Georgia and West Asia What is the total consumption? Provide step-by-step reasoning before giving your answer.
Sure enough, GPT-4 clearly lists the steps to solve the problem——
1. Determine the average daily meat in Georgia category consumption.
2. Determine the average daily meat consumption in West Asia.
3. Add the values from steps 1 and 2.
Do physics problems
Require GPT-4 to solve Paris Comprehensive A physics question from science and engineering, the radiation detection principle of bolometer. It is worth noting that this is still a French question.
GPT-4 Start solving the problem: To answer question I.1.a, we need the temperature of each point T(x) is represented by the abscissa x of the conductive rod.
The subsequent problem-solving process is full of high energy.
Do you think this is all the capabilities of GPT-4?
Boss Greg Brockman went directly online to demonstrate. Through this video, you can intuitively feel the capabilities of GPT-4.
The most amazing thing is that GPT-4 has a strong ability to understand the code and help you generate code.
Greg directly drew a scribbled diagram on the paper, took a picture, sent it to GPT and said, write the web page code according to this layout, and then he wrote it.
In addition, if something goes wrong during the operation, throwing the error message, or even a screenshot of the error message, to GPT-4 can help. You give corresponding prompts.
Netizens said directly: GPT-4 conference will teach you step by step how to replace programmers.
By the way, you can also file taxes using GPT-4. You know, Americans spend a lot of time and money filing taxes every year.
Training process
Like the previous GPT model, the training of the GPT-4 basic model uses public Internet data and data licensed from OpenAI for the purpose of predicting the next word in a document.
The data is an internet-based corpus that includes correct/incorrect solutions to mathematical problems, weak/strong reasoning, contradictory/consistent statements, sufficient to represent A vast array of ideologies and ideas.
When the user gives a prompt to ask a question, the basic model can respond in various ways, but the answer may be far from the user's intention.
So, to align it with the user’s intent, OpenAI fine-tuned the model’s behavior using reinforcement learning based on human feedback (RLHF).
However, the model’s ability seems to come mainly from the pre-training process, and RLHF cannot improve test scores (if it is not actively reinforced, it will actually reduce test scores).
The basic model needs to prompt the project to know that it should answer the question. Therefore, the guidance of the model mainly comes from the post-training process.
A major focus of the GPT-4 model is to establish a predictably scalable deep learning stack. Because for large training like GPT-4, extensive model-specific tuning is not feasible.
Therefore, the OpenAI team developed infrastructure and optimizations that have predictable behavior at multiple scales.
To verify this scalability, the researchers accurately predicted the final loss of GPT-4 on an internal code base (not part of the training set) in advance by using the same The model trained by the method is used for inference, but the amount of calculation used is 1/10000.
OpenAI can now accurately predict metric losses optimized during training. For example, inferring from a model with a computational effort of 1/1000 and successfully predicting the pass rate of a subset of the HumanEval data set:
There are also some abilities that are still difficult to predict. For example, the Inverse Scaling competition aimed to find a metric that gets worse as the computational load of the model increases, and the hindsight neglect task was one of the winners. But GPT-4 reverses this trend:
OpenAI believes that machine learning capabilities that can accurately predict the future are important for technological security is crucial, yet it does not receive enough attention.
Now, OpenAI is investing more energy in developing related methods and calling on the industry to work together.
Contribution list
At the same time as GPT-4 was released, Open AI also disclosed the organizational structure and personnel list of GPT-4.
##Swipe up and down to view all
Professor Chen Baoquan of Peking University said,
No matter how good the movie is, no one will watch the final credits from beginning to end. Open AI’s show doesn’t even take this unusual approach. There is no doubt that this will be the list of "cast members" (contributors) that is not only the most read, but also carefully studied. The biggest attraction is the detailed classification of contributions, which is almost a rough department structure. .
This very "bold" disclosure actually has far-reaching significance. It reflects the core concept behind Open AI and also indicates the direction of future progress to a certain extent.
The above is the detailed content of The GPT-4 king is crowned! Your ability to read pictures and answer questions is amazing. You can get into Stanford by yourself.. For more information, please follow other related articles on the PHP Chinese website!
![[Ghibli-style images with AI] Introducing how to create free images with ChatGPT and copyright](https://img.php.cn/upload/article/001/242/473/174707263295098.jpg?x-oss-process=image/resize,p_40)
The latest model GPT-4o released by OpenAI not only can generate text, but also has image generation functions, which has attracted widespread attention. The most eye-catching feature is the generation of "Ghibli-style illustrations". Simply upload the photo to ChatGPT and give simple instructions to generate a dreamy image like a work in Studio Ghibli. This article will explain in detail the actual operation process, the effect experience, as well as the errors and copyright issues that need to be paid attention to. For details of the latest model "o3" released by OpenAI, please click here⬇️ Detailed explanation of OpenAI o3 (ChatGPT o3): Features, pricing system and o4-mini introduction Please click here for the English version of Ghibli-style article⬇️ Create Ji with ChatGPT

As a new communication method, the use and introduction of ChatGPT in local governments is attracting attention. While this trend is progressing in a wide range of areas, some local governments have declined to use ChatGPT. In this article, we will introduce examples of ChatGPT implementation in local governments. We will explore how we are achieving quality and efficiency improvements in local government services through a variety of reform examples, including supporting document creation and dialogue with citizens. Not only local government officials who aim to reduce staff workload and improve convenience for citizens, but also all interested in advanced use cases.

Have you heard of a framework called the "Fukatsu Prompt System"? Language models such as ChatGPT are extremely excellent, but appropriate prompts are essential to maximize their potential. Fukatsu prompts are one of the most popular prompt techniques designed to improve output accuracy. This article explains the principles and characteristics of Fukatsu-style prompts, including specific usage methods and examples. Furthermore, we have introduced other well-known prompt templates and useful techniques for prompt design, so based on these, we will introduce C.

ChatGPT Search: Get the latest information efficiently with an innovative AI search engine! In this article, we will thoroughly explain the new ChatGPT feature "ChatGPT Search," provided by OpenAI. Let's take a closer look at the features, usage, and how this tool can help you improve your information collection efficiency with reliable answers based on real-time web information and intuitive ease of use. ChatGPT Search provides a conversational interactive search experience that answers user questions in a comfortable, hidden environment that hides advertisements

In a modern society with information explosion, it is not easy to create compelling articles. How to use creativity to write articles that attract readers within a limited time and energy requires superb skills and rich experience. At this time, as a revolutionary writing aid, ChatGPT attracted much attention. ChatGPT uses huge data to train language generation models to generate natural, smooth and refined articles. This article will introduce how to effectively use ChatGPT and efficiently create high-quality articles. We will gradually explain the writing process of using ChatGPT, and combine specific cases to elaborate on its advantages and disadvantages, applicable scenarios, and safe use precautions. ChatGPT will be a writer to overcome various obstacles,

An efficient guide to creating charts using AI Visual materials are essential to effectively conveying information, but creating it takes a lot of time and effort. However, the chart creation process is changing dramatically due to the rise of AI technologies such as ChatGPT and DALL-E 3. This article provides detailed explanations on efficient and attractive diagram creation methods using these cutting-edge tools. It covers everything from ideas to completion, and includes a wealth of information useful for creating diagrams, from specific steps, tips, plugins and APIs that can be used, and how to use the image generation AI "DALL-E 3."

Unlock ChatGPT Plus: Fees, Payment Methods and Upgrade Guide ChatGPT, a world-renowned generative AI, has been widely used in daily life and business fields. Although ChatGPT is basically free, the paid version of ChatGPT Plus provides a variety of value-added services, such as plug-ins, image recognition, etc., which significantly improves work efficiency. This article will explain in detail the charging standards, payment methods and upgrade processes of ChatGPT Plus. For details of OpenAI's latest image generation technology "GPT-4o image generation" please click: Detailed explanation of GPT-4o image generation: usage methods, prompt word examples, commercial applications and differences from other AIs Table of contents ChatGPT Plus Fees Ch

How to use ChatGPT to streamline your design work and increase creativity This article will explain in detail how to create a design using ChatGPT. We will introduce examples of using ChatGPT in various design fields, such as ideas, text generation, and web design. We will also introduce points that will help you improve the efficiency and quality of a variety of creative work, such as graphic design, illustration, and logo design. Please take a look at how AI can greatly expand your design possibilities. table of contents ChatGPT: A powerful tool for design creation


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

SublimeText3 English version
Recommended: Win version, supports code prompts!

Safe Exam Browser
Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Dreamweaver CS6
Visual web development tools

Atom editor mac version download
The most popular open source editor
