OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.-AI-php.cn

Home

Technology peripherals

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Apr 22, 2023 am 09:58 AM

aiOpen source

The popularity of ChatGPT and Midjourney has made the technology diffusion model behind them the foundation of the “generative AI” revolution.

Even, it is highly sought after by researchers in the industry, and its popularity far exceeds that of GAN, which once attacked the world.

Just when diffusion models were at their most powerful, some netizens suddenly announced in a high profile:

The era of Diffusion models is over! Consistency models are crowned king!

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

what on earth is it? ? ?

It turns out that OpenAI released a blockbuster and valuable paper "Consistency Models" in March, and released the model weights on GitHub today.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

##Paper address: https://arxiv.org/abs/2303.01469

Project address: https://github.com/openai/consistency_models

"Consistency Model" in training speed It subverts the diffusion model and can "generate in one step", completing simple tasks an order of magnitude faster than the diffusion model, and using 10-2000 times less calculations.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

So, how fast is this?

Some netizens said that it is equivalent to generating 64 images with a resolution of 256x256 in about 3.5 seconds, which is

18 images per second!

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Moreover, one of the main advantages of the latest model is that it can achieve high-quality samples without the need for "adversarial training" .

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time. ##This research

was conducted by Ilya Sutskever, one of the Hinton students of Turing’s Big Three and the main promoter of AlexNet Written by , as well as Chinese scholars Mark Chen and Prafulla Dhariwal who developed DALL-E 2, you can imagine how hard-core the research content is.

Some netizens even said that the “consistency model” is the future research direction. I believe we will definitely laugh at the diffusion model in the future.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time. So, the diffusion model also disappears?

Faster, stronger, no need for confrontation

Currently, this paper is still an unfinalized version, and research is still ongoing.

In 2021, OpenAI CEO Sam Altman wrote a blog discussing how Moore’s Law should be applied to all fields.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

## Altman publicly talked about artificial intelligence on Twitter some time ago and said that artificial intelligence is achieving "leapfrog". He said, "A new version of Moore's Law may soon appear, with the number of intelligences in the universe doubling every 18 months."

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

To others, Altman’s optimism may seem unfounded.

But the latest research conducted by the team led by OpenAI’s chief scientist Ilya Sutskever provides strong support for Altman’s claim.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

It is said that 2022 is the first year of AIGC, because many models are based on the diffusion model.

The popularity of the diffusion model gradually replaced GAN and became the most effective image generation model in the current industry. For example, DALL.E 2 and Google Imagen are both diffusion models.

However, the newly proposed "consistency model" has been proven to be able to output the same quality content as the diffusion model in a shorter time.

This is because this "consistency model" uses a single-step generation process similar to GAN.

In contrast, the diffusion model uses a repeated sampling process to gradually eliminate noise in the image.

This method, although impressive, relies on performing hundreds to thousands of steps to achieve good results, which is not only expensive to operate, but also slow.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The continuous iterative generation process of the diffusion model consumes 10-2000 more calculations than the "consistency model" times, even slowing down inference during training.

The power of the "Consistency Model" lies in its ability to make a trade-off between sample quality and computing resources when necessary.

Additionally, this model is capable of performing zero-shot data editing tasks such as image patching, colorization, or stroke-guided image editing.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Zero-shot image editing using a consensus model trained by distillation on LSUN Bedroom 256^256

The "Consistency Model" also converts data into noise when using mathematical equations and ensures that the resulting output is consistent for similar data points, thereby enabling them to smooth transition.

This type of equation is called "Probability Flow Ordinary Differential Equation" (Probability Flow ODE).

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

This study named such models "consistency" because they maintain this self-consistency between input data and output data.

These models can be trained in either distillation mode or isolation mode.

In distillation mode, the model is able to extract data from a pre-trained diffusion model, enabling it to be executed in a single step.

In detached mode, the model does not depend on the diffusion model at all, making it a completely independent model.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

It is worth noting that both training methods remove "adversarial training" from them.

I have to admit that adversarial training will indeed produce a more powerful neural network, but the process is more circuitous. That is, it introduces a set of misclassified adversarial samples and then retrains the target neural network with the correct labels.

Therefore, adversarial training will also lead to a slight decrease in the accuracy of deep learning model predictions, and it may even bring unexpected side effects in robotic applications.

Experimental results show that the distillation technique used to train the "consistency model" is better than that used for the diffusion model.

The "Consistency Model" achieved the latest state-of-the-art FID scores of 3.55 and 6.20 on the CIFAR10 image set and ImageNet 64x64 data set, respectively.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

#This is simply realized, diffusion model The quality of GANs, the speed, is doubly perfect.

In February, Sutskever posted a tweet suggesting that

Many people believe that great AI progress must include a new "idea." But that’s not the case: Many of AI’s greatest advances have come in the form of, well, that familiar humble idea that, if done well, becomes incredible.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The latest research proves just that, and tweaking an old concept can change everything.

Author Introduction

As the co-founder and chief scientist of OpenAI, Ilya Sutskever No need to go into details, just take a look at this group photo of the "top performers".

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

## (far right of the picture)

Yang Song (Song Yang)

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Song Yang, the first author of the paper, is a research scientist at OpenAI.

Previously, he received a bachelor's degree in mathematics and physics from Tsinghua University and a master's and doctorate in computer science from Stanford University. In addition, he has interned at Google Brain, Uber ATG, and Microsoft Research.

As a machine learning researcher, he focuses on developing scalable methods to model, analyze and generate complex high-dimensional data. His interests span multiple areas, including generative modeling, representation learning, probabilistic reasoning, artificial intelligence security, and AI for science.

Mark Chen

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Mark Chen is the head of OpenAI’s multimodal and cutting-edge research department, He is also the coach of the U.S. Computer Olympiad team.

Previously, he earned a bachelor's degree in mathematics and computer science from MIT and worked as a quantitative trader at several proprietary trading firms, including Jane Street Capital.

After joining OpenAI, he led the team to develop DALL-E 2 and introduced vision into GPT-4. In addition, he led the development of Codex, participated in the GPT-3 project, and created Image GPT.

Prafulla Dhariwal

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

##Prafulla Dhariwal is a Research Scientist at OpenAI, working on generative models and autonomous Supervised learning. Before that, he was an undergraduate at MIT, studying computing, mathematics, and physics.

Interestingly, the diffusion model can beat GAN in the field of image generation, which was what he proposed in the 2021 NeurIPS paper.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Netizen: Finally made it back to Open AI

OpenAI opened the source code of the consistency model today .

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Finally back to Open AI.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Faced with so many crazy breakthroughs and announcements every day. Netizens asked: Should we take a break or speed up?

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

This will significantly save researchers the cost of training models compared to diffusion models.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Some netizens also gave future use cases of the "consistency model": real-time editing, NeRF rendering, real-time games render.

There is currently no demo demonstration, but it is worth confirming that the speed of image generation can be greatly improved and is always the winner.

We upgraded directly from dial-up to broadband.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

Brain-computer interface, plus ultra-realistic images generated in almost real time.

OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.

The above is the detailed content of OpenAI releases a new consistency model, GAN speed reaches 18FPS, and can generate high-quality images in real time.. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]May 14, 2025 am 05:04 AM

ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

Calculating The Risk Of ASI Starts With Human MindsMay 14, 2025 am 05:02 AM

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

An easy-to-understand explanation of how to write and compose lyrics and recommended tools in ChatGPTMay 14, 2025 am 05:01 AM

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

What is ChatGPT-4? A thorough explanation of what you can do, the pricing, and the differences from GPT-3.5!May 14, 2025 am 05:00 AM

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

Explaining how to use the ChatGPT app! Japanese support and voice conversation functionMay 14, 2025 am 04:59 AM

ChatGPT App: Unleash your creativity with the AI assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

How do I use the Chinese version of ChatGPT? Explanation of registration procedures and feesMay 14, 2025 am 04:56 AM

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

5 AI Agent Myths You Need To Stop Believing NowMay 14, 2025 am 04:54 AM

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

An easy-to-understand explanation of the illegality of creating and managing multiple accounts using ChatGPTMay 14, 2025 am 04:50 AM

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

How to fix KB5055612 fails to install in Windows 10?

4 weeks agoByDDD

Roblox: Bubble Gum Simulator Infinity - How To Get And Use Royal Keys

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Roblox: Grow A Garden - Complete Mutation Guide

3 weeks agoByDDD

Nordhold: Fusion System, Explained

4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Mandragora: Whispers Of The Witch Tree - How To Unlock The Grappling Hook

3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

VSCode Windows 64-bit Download

A free and powerful IDE editor launched by Microsoft

Notepad++7.3.1

Easy-to-use and free code editor

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.