


OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.
The popularity of ChatGPT and Midjourney has made the technology diffusion model behind them the foundation of the “generative AI” revolution.
Even, it is highly sought after by researchers in the industry, and its popularity far exceeds that of GAN, which once attacked the world.
Just when diffusion models were at their most powerful, some netizens suddenly announced in a high profile:
The era of Diffusion models is over! Consistency models are crowned king!
what on earth is it? ? ?
It turns out that OpenAI released a blockbuster and valuable paper "Consistency Models" in March, and released the model weights on GitHub today.
##Paper address: https://arxiv.org/abs/2303.01469
Project address: https://github.com/openai/consistency_models
"Consistency Model" in training speed It subverts the diffusion model and can "generate in one step", completing simple tasks an order of magnitude faster than the diffusion model, and using 10-2000 times less calculations.
Some netizens said that it is equivalent to generating 64 images with a resolution of 256x256 in about 3.5 seconds, which is
18 images per second!
##This research
Some netizens even said that the “consistency model” is the future research direction. I believe we will definitely laugh at the diffusion model in the future.
So, the diffusion model also disappears?
Faster, stronger, no need for confrontation
Currently, this paper is still an unfinalized version, and research is still ongoing.
In 2021, OpenAI CEO Sam Altman wrote a blog discussing how Moore’s Law should be applied to all fields.
## Altman publicly talked about artificial intelligence on Twitter some time ago and said that artificial intelligence is achieving "leapfrog". He said, "A new version of Moore's Law may soon appear, with the number of intelligences in the universe doubling every 18 months."
To others, Altman’s optimism may seem unfounded.
But the latest research conducted by the team led by OpenAI’s chief scientist Ilya Sutskever provides strong support for Altman’s claim.
It is said that 2022 is the first year of AIGC, because many models are based on the diffusion model.
The popularity of the diffusion model gradually replaced GAN and became the most effective image generation model in the current industry. For example, DALL.E 2 and Google Imagen are both diffusion models.
However, the newly proposed "consistency model" has been proven to be able to output the same quality content as the diffusion model in a shorter time.
This is because this "consistency model" uses a single-step generation process similar to GAN.
In contrast, the diffusion model uses a repeated sampling process to gradually eliminate noise in the image.
This method, although impressive, relies on performing hundreds to thousands of steps to achieve good results, which is not only expensive to operate, but also slow.
The continuous iterative generation process of the diffusion model consumes 10-2000 more calculations than the "consistency model" times, even slowing down inference during training.
The power of the "Consistency Model" lies in its ability to make a trade-off between sample quality and computing resources when necessary.
Additionally, this model is capable of performing zero-shot data editing tasks such as image patching, colorization, or stroke-guided image editing.
Zero-shot image editing using a consensus model trained by distillation on LSUN Bedroom 256^256
The "Consistency Model" also converts data into noise when using mathematical equations and ensures that the resulting output is consistent for similar data points, thereby enabling them to smooth transition.
This type of equation is called "Probability Flow Ordinary Differential Equation" (Probability Flow ODE).
This study named such models "consistency" because they maintain this self-consistency between input data and output data.
These models can be trained in either distillation mode or isolation mode.
In distillation mode, the model is able to extract data from a pre-trained diffusion model, enabling it to be executed in a single step.
In detached mode, the model does not depend on the diffusion model at all, making it a completely independent model.
It is worth noting that both training methods remove "adversarial training" from them.
I have to admit that adversarial training will indeed produce a more powerful neural network, but the process is more circuitous. That is, it introduces a set of misclassified adversarial samples and then retrains the target neural network with the correct labels.
Therefore, adversarial training will also lead to a slight decrease in the accuracy of deep learning model predictions, and it may even bring unexpected side effects in robotic applications.
Experimental results show that the distillation technique used to train the "consistency model" is better than that used for the diffusion model.
The "Consistency Model" achieved the latest state-of-the-art FID scores of 3.55 and 6.20 on the CIFAR10 image set and ImageNet 64x64 data set, respectively.
#This is simply realized, diffusion model The quality of GANs, the speed, is doubly perfect.
In February, Sutskever posted a tweet suggesting that
Many people believe that great AI progress must include a new "idea." But that’s not the case: Many of AI’s greatest advances have come in the form of, well, that familiar humble idea that, if done well, becomes incredible.The latest research proves just that, and tweaking an old concept can change everything.
Author IntroductionAs the co-founder and chief scientist of OpenAI, Ilya Sutskever No need to go into details, just take a look at this group photo of the "top performers".
## (far right of the picture)
Yang Song (Song Yang)Previously, he received a bachelor's degree in mathematics and physics from Tsinghua University and a master's and doctorate in computer science from Stanford University. In addition, he has interned at Google Brain, Uber ATG, and Microsoft Research. As a machine learning researcher, he focuses on developing scalable methods to model, analyze and generate complex high-dimensional data. His interests span multiple areas, including generative modeling, representation learning, probabilistic reasoning, artificial intelligence security, and AI for science. Mark Chen is the head of OpenAI’s multimodal and cutting-edge research department, He is also the coach of the U.S. Computer Olympiad team. Previously, he earned a bachelor's degree in mathematics and computer science from MIT and worked as a quantitative trader at several proprietary trading firms, including Jane Street Capital. After joining OpenAI, he led the team to develop DALL-E 2 and introduced vision into GPT-4. In addition, he led the development of Codex, participated in the GPT-3 project, and created Image GPT. ##Prafulla Dhariwal is a Research Scientist at OpenAI, working on generative models and autonomous Supervised learning. Before that, he was an undergraduate at MIT, studying computing, mathematics, and physics. Interestingly, the diffusion model can beat GAN in the field of image generation, which was what he proposed in the 2021 NeurIPS paper. Mark Chen
Prafulla Dhariwal
OpenAI opened the source code of the consistency model today .
Finally back to Open AI.
Faced with so many crazy breakthroughs and announcements every day. Netizens asked: Should we take a break or speed up?
This will significantly save researchers the cost of training models compared to diffusion models.
Some netizens also gave future use cases of the "consistency model": real-time editing, NeRF rendering, real-time games render.
There is currently no demo demonstration, but it is worth confirming that the speed of image generation can be greatly improved and is always the winner.
We upgraded directly from dial-up to broadband.
Brain-computer interface, plus ultra-realistic images generated in almost real time.
The above is the detailed content of OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.. For more information, please follow other related articles on the PHP Chinese website!
![Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]](https://img.php.cn/upload/article/001/242/473/174717025174979.jpg?x-oss-process=image/resize,p_40)
ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

ChatGPT App: Unleash your creativity with the AI assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft

Notepad++7.3.1
Easy-to-use and free code editor

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool

EditPlus Chinese cracked version
Small size, syntax highlighting, does not support code prompt function
