search
HomeTechnology peripheralsAIIt doesn't matter if you don't know how to use PS, AI puzzle technology can already make the fake look real.

In the past two years, the "diffusion model of text-generated images" has become quite popular. DALL·E 2 and Imagen are both applications developed based on this.

This article is reprinted with the authorization of AI New Media Qubit (public account ID: QbitAI). Please contact the source for reprinting.

This is a seemingly ordinary Japanese bento.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

But can you believe it, in fact, every grid of food is P-ed, and the original picture is still Aunt Jiang’s:

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

△Directly cut out the image and paste it on, and the effect will look fake at first sight

The operator behind it is not a PS boss, but an AI with a very straightforward name: Collage Diffusion.

Just find a few small pictures and give it to it, and the AI ​​will be able to understand the content of the picture on its own, and then put the elements very naturally into a big picture - there is no fakeness at first glance.

The effect amazed many netizens.

Some PS enthusiasts even said directly:

This is simply a godsend... I hope it will be available in Automatic1111 soon (the network UI commonly used by Stable Diffusion users will also be integrated into PS see it in the plug-in version of .

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

Why is the effect so natural?

In fact, there are several generated versions of the "Japanese bento" generated by this AI - all of them are natural and natural.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

As for why there are multiple versions? The reason why I ask is because users can also customize it. They can fine-tune various details without making the overall situation too outrageous.

In addition to "Japanese bento", it also has many outstanding works.

For example, this is the material given to the AI. The P picture traces are obvious:

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

This is the picture put together by the AI. Anyway, I didn’t look at it. What P-picture traces are there:

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

In the past two years, the "diffusion model of text-generated images" has become really popular. DALL·E 2 and Imagen are both based on this developed applications. The advantage of this diffusion model is that the generated images are diverse and of high quality.

However, text after all, for the target image, it can only play a fuzzy normative role at best, so users usually have to spend a lot of time adjusting the prompts, and also It must be paired with additional control components to achieve good results.

Take the Japanese bento shown above as an example:

If the user only enters "a bento box containing rice, edamame, ginger and sushi", then it does not describe what kind of bento. There is no explanation of where the food is placed or what each food looks like. But if you have to make it clear, the user may have to write a short essay...

In view of this, the Stanford team decided to start from another angle.

They decided to refer to traditional ideas and generate the final image through puzzle, and thus developed a new diffusion model.

What’s interesting is that, to put it bluntly, this model can be considered “spelled out” using classic techniques.

The first is layering: Use the layer-based image editing UI to decompose the source image into RGBA layers (R, G, and B represent red, green, and blue respectively, A for transparency), then arrange these layers on the canvas and pair each layer with a text prompt.

Through layering, various elements in the image can be modified.

So far, layering has been a mature technology in the field of computer graphics, but previously layered information was generally used as a single image output result.

In this new “puzzle diffusion model”, hierarchical information becomes the input for subsequent operations.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

In addition to layering, is also paired with existing diffusion-based image coordination technology to improve the visual quality of images.

In short, this algorithm not only limits changes in certain attributes of objects (such as visual features), but also allows attributes (direction, lighting, perspective, occlusion) to change.

——This balances the relationship between the degree of restoration and the degree of naturalness, generating pictures that are “spiritually similar” and have no sense of violation.

The operation process is also very easy. In interactive editing mode, users can create a collage in a few minutes.

They can not only customize the spatial arrangement in the scene (that is, put the pictures taken from elsewhere into the appropriate position); they can also adjust the various components that generate the image. Using the same source image, you can get different effects.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

△The rightmost column is the output result of this AI

And in the non-interactive mode (that is, the user does not puzzle, but directly puts a bunch of small pictures into the puzzle) Throw it to AI), and AI can automatically create a large picture with natural effects based on the small picture it gets.

Research Team

Finally, let’s talk about the research team behind it. They are a group of teachers and students from the Computer Science Department of Stanford University.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

The first author of the thesis, Vishnu Sarukkai is currently a graduate student in the Department of Computer Science at Stanford, where he is still studying for a master's degree and a Ph.D.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

His main research directions are: computer graphics, computer vision and machine learning.

In addition, Linden Li, the co-author of the paper, is also a graduate student in the Department of Computer Science at Stanford.

It doesnt matter if you dont know how to use PS, AI puzzle technology can already make the fake look real.

While studying at school, he interned at NVIDIA for 4 months. He collaborated with NVIDIA's deep learning research team and participated in training a visual converter model that added 100M parameters.

Paper address: https://arxiv.org/abs/2303.00262

It doesn't matter if you don't know how to use PS, AI puzzle technology can already make the fake look real.

The above is the detailed content of It doesn't matter if you don't know how to use PS, AI puzzle technology can already make the fake look real.. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
The Hidden Dangers Of AI Internal Deployment: Governance Gaps And Catastrophic RisksThe Hidden Dangers Of AI Internal Deployment: Governance Gaps And Catastrophic RisksApr 28, 2025 am 11:12 AM

The unchecked internal deployment of advanced AI systems poses significant risks, according to a new report from Apollo Research. This lack of oversight, prevalent among major AI firms, allows for potential catastrophic outcomes, ranging from uncont

Building The AI PolygraphBuilding The AI PolygraphApr 28, 2025 am 11:11 AM

Traditional lie detectors are outdated. Relying on the pointer connected by the wristband, a lie detector that prints out the subject's vital signs and physical reactions is not accurate in identifying lies. This is why lie detection results are not usually adopted by the court, although it has led to many innocent people being jailed. In contrast, artificial intelligence is a powerful data engine, and its working principle is to observe all aspects. This means that scientists can apply artificial intelligence to applications seeking truth through a variety of ways. One approach is to analyze the vital sign responses of the person being interrogated like a lie detector, but with a more detailed and precise comparative analysis. Another approach is to use linguistic markup to analyze what people actually say and use logic and reasoning. As the saying goes, one lie breeds another lie, and eventually

Is AI Cleared For Takeoff In The Aerospace Industry?Is AI Cleared For Takeoff In The Aerospace Industry?Apr 28, 2025 am 11:10 AM

The aerospace industry, a pioneer of innovation, is leveraging AI to tackle its most intricate challenges. Modern aviation's increasing complexity necessitates AI's automation and real-time intelligence capabilities for enhanced safety, reduced oper

Watching Beijing's Spring Robot RaceWatching Beijing's Spring Robot RaceApr 28, 2025 am 11:09 AM

The rapid development of robotics has brought us a fascinating case study. The N2 robot from Noetix weighs over 40 pounds and is 3 feet tall and is said to be able to backflip. Unitree's G1 robot weighs about twice the size of the N2 and is about 4 feet tall. There are also many smaller humanoid robots participating in the competition, and there is even a robot that is driven forward by a fan. Data interpretation The half marathon attracted more than 12,000 spectators, but only 21 humanoid robots participated. Although the government pointed out that the participating robots conducted "intensive training" before the competition, not all robots completed the entire competition. Champion - Tiangong Ult developed by Beijing Humanoid Robot Innovation Center

The Mirror Trap: AI Ethics And The Collapse Of Human ImaginationThe Mirror Trap: AI Ethics And The Collapse Of Human ImaginationApr 28, 2025 am 11:08 AM

Artificial intelligence, in its current form, isn't truly intelligent; it's adept at mimicking and refining existing data. We're not creating artificial intelligence, but rather artificial inference—machines that process information, while humans su

New Google Leak Reveals Handy Google Photos Feature UpdateNew Google Leak Reveals Handy Google Photos Feature UpdateApr 28, 2025 am 11:07 AM

A report found that an updated interface was hidden in the code for Google Photos Android version 7.26, and each time you view a photo, a row of newly detected face thumbnails are displayed at the bottom of the screen. The new facial thumbnails are missing name tags, so I suspect you need to click on them individually to see more information about each detected person. For now, this feature provides no information other than those people that Google Photos has found in your images. This feature is not available yet, so we don't know how Google will use it accurately. Google can use thumbnails to speed up finding more photos of selected people, or may be used for other purposes, such as selecting the individual to edit. Let's wait and see. As for now

Guide to Reinforcement Finetuning - Analytics VidhyaGuide to Reinforcement Finetuning - Analytics VidhyaApr 28, 2025 am 09:30 AM

Reinforcement finetuning has shaken up AI development by teaching models to adjust based on human feedback. It blends supervised learning foundations with reward-based updates to make them safer, more accurate, and genuinely help

Let's Dance: Structured Movement To Fine-Tune Our Human Neural NetsLet's Dance: Structured Movement To Fine-Tune Our Human Neural NetsApr 27, 2025 am 11:09 AM

Scientists have extensively studied human and simpler neural networks (like those in C. elegans) to understand their functionality. However, a crucial question arises: how do we adapt our own neural networks to work effectively alongside novel AI s

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

SublimeText3 English version

SublimeText3 English version

Recommended: Win version, supports code prompts!

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.