What are the trends in the development of artificial intelligence in 2022?
#Be sure to mention the rise of “multi-modal AI”, especially text-to-image generation tools.
From DALL-E to Imagen, Parti, Nuwa, etc., they can all produce high-quality images that are amazing.
The most typical example of this is OpenAI’s Dall-E2.
Since Dall-E came out, you may have seen it generate many painting-style pictures, such as astronauts riding horses in space.
However, there are very few images that express abstract concepts through Dall-E.
No, Gabriele Sgroi, a machine learning scientist, came to explore how DALL-E accomplishes this task.
He tested oil pastels and painting styles on themes of sadness, love, anger, happiness, justice and injustice.
Oil Pastel Style
Sadness
Anger
## Happiness
like
Sadness
##爱
#ANGER
happiness
Justice
Injustice
##Gabriele Sgroi believes that painting will be more Be insightful rather than limiting emotional images to people's facial expressions.
#All images in this article (including the cover image) were generated using DALL-E to select all images provided by the first generation from a given prompt.
It can be seen from these examples that although a given emotion is not always clearly identifiable, DALL-E has an overall strong sense of style in painting. Show more abstract and complex pictures.
Among them, most of the pictures representing justice depict a Greek goddess, but the images representing injustice are really confusing.
# Overall, Sgroi observed that the results depend heavily on the chosen style.
#And in most cases, DALL-E will write the name of the emotion on the resulting drawing.
Overall, DALL-E appears to show a level of understanding of the emotions tested, correctly relating them to facial expressions and the colors or symbols typically associated with them pair.
Sgroi said it would be interesting to further investigate the differences in representations of the same emotions across styles and to examine whether the observed bias between positive and negative emotions holds true in other examples. still exists, it will be interesting.
Did DALL-E fail?Ironically, DALL-E 2 claims to be good at understanding the text prompts used to generate images.
#However, some netizens discovered that when the text cannot be understood currently, the text content will be placed in the generated image.
#For example, a painting "This is Not a Pipe" by the artist Rene Magritte.
There is also an artificial intelligence Janelle Shane who asked DALL-E 2 to generate a company logo, but found that no picture could Spell the words correctly.
Waffle House generation example
Also , you could say the DALL-E 2 understands some scientific laws.
#Because it can easily depict falling objects, or astronauts floating in space.
#However, if you want to generate an anatomy, an X-ray image, a mathematical proof, or a blueprint, the resulting image may be superficially correct, but fundamentally All wrong.
#For example, in the picture of the solar system drawn to scale, it can be said to be a mess, with the shape of the earth in the lower left corner and an object similar to a poached egg in the upper left corner.
It tries to make up something visually similar without understanding the meaning, explained OpenAI researcher Aditya Ramesh.
So DALL-E 2 doesn’t know what science is, it only knows how to read text and draw illustrations.
And when DALL-E 2 generates human faces, they are so realistic that it is almost unbelievable.
During training, OpenAI introduced deepfake protection measures to prevent it from remembering faces that often appear on the Internet.
#If the uploaded image contains real faces, even unknown people, the system will refuse to generate the content.
#However, another problem arises, OpenAI said that the system is optimized for images with a single focus of attention
For example, the generation of a detailed portrait of "an astronaut staring at the earth with a longing expression on his face" is very successful.
However, when DALL-E was asked to generate images of multiple people at once, it crashed directly. So it gets really bad at generating group shots and crowd scenes.
In addition, DALL-E will also generate some biased images.
#Currently, the OpenAI team has begun to correct biases through machine learning.
For example, during the training of DALL-E 2, the researchers adjusted the training method and increased the weight of female images so they were more likely to be generated .
DALL-E will bring more surprises in the future.
The above is the detailed content of Can AI map emotions? See how DALL-E expresses abstraction. For more information, please follow other related articles on the PHP Chinese website!

This article explains the Term Frequency-Inverse Document Frequency (TF-IDF) technique, a crucial tool in Natural Language Processing (NLP) for analyzing textual data. TF-IDF surpasses the limitations of basic bag-of-words approaches by weighting te

Unleash the Power of AI Agents with LangChain: A Beginner's Guide Imagine showing your grandmother the wonders of artificial intelligence by letting her chat with ChatGPT – the excitement on her face as the AI effortlessly engages in conversation! Th

Mistral Large 2: A Deep Dive into Mistral AI's Powerful Open-Source LLM Meta AI's recent release of the Llama 3.1 family of models was quickly followed by Mistral AI's unveiling of its largest model to date: Mistral Large 2. This 123-billion paramet

Understanding Noise Schedules in Diffusion Models: A Comprehensive Guide Have you ever been captivated by the stunning visuals of digital art generated by AI and wondered about the underlying mechanics? A key element is the "noise schedule,&quo

Building a Contextual Chatbot with GPT-4o: A Comprehensive Guide In the rapidly evolving landscape of AI and NLP, chatbots have become indispensable tools for developers and organizations. A key aspect of creating truly engaging and intelligent chat

This article explores seven leading frameworks for building AI agents – autonomous software entities that perceive, decide, and act to achieve goals. These agents, surpassing traditional reinforcement learning, leverage advanced planning and reasoni

Understanding Type I and Type II Errors in Statistical Hypothesis Testing Imagine a clinical trial testing a new blood pressure medication. The trial concludes the drug significantly lowers blood pressure, but in reality, it doesn't. This is a Type

Sumy: Your AI-Powered Summarization Assistant Tired of sifting through endless documents? Sumy, a powerful Python library, offers a streamlined solution for automatic text summarization. This article explores Sumy's capabilities, guiding you throug


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

MinGW - Minimalist GNU for Windows
This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

SublimeText3 English version
Recommended: Win version, supports code prompts!

SublimeText3 Chinese version
Chinese version, very easy to use

SAP NetWeaver Server Adapter for Eclipse
Integrate Eclipse with SAP NetWeaver application server.

PhpStorm Mac version
The latest (2018.2.1) professional PHP integrated development tool