Google's new AI is hot! You can draw the longest word in the world-AI-php.cn

Home

Technology peripherals

Google's new AI is hot! You can draw the longest word in the world

王林

Apr 09, 2023 pm 09:51 PM

Googleaiparti

Friend, do you know what this English word is?

Pneumonoultramicroscopicsilicovolcanoconiosis.

This is the longest recognized word in the world - consisting of 45 letters, which means "a disease caused by the deposition of volcanic silica particles in the lungs" (commonly known as volcanic silicosis ).

But what if, instead of asking you to spell this word, you... draw it?

(You can’t even read, but you still draw???)

The latest AI proposed by Google - Parti, can easily hold this problem.

After "feeding" this word to Parti, it will be able to generate multiple reasonable pictures of lung diseases:

Googles new AI is hot! You can draw the longest word in the world

But this is just a small test of Parti’s capabilities. According to Google, it is currently the most advanced “text-to-image” AI.

For example, if you tell it: "Combine the Sydney Opera House with the Eiffel Tower", the output will be like this:

Googles new AI is hot! You can draw the longest word in the world

(I don’t know (I really thought it was a pictorial)

Moreover, in terms of algorithm, it is different from Google’s own Imagen. Parti can be said to have taken “AI painting” to a new level.

Googles new AI is hot! You can draw the longest word in the world

Even Jeff Dean, the head of Google AI, tweeted several times and had a great time:

Googles new AI is hot! You can draw the longest word in the world

Extensible to 20 billion parameters: more realistic, more "smart"

In fact, Parti's capabilities don't stop there.

Thanks to the model’s scalability to 20 billion parameters, on the one hand, the images it generates are more detailed and realistic.

Whether it is just a few words or a short paragraph of more than fifty words, it can be clearly displayed.

For example, The back of a violin, the back of the violin.

Googles new AI is hot! You can draw the longest word in the world

#Or maybe it is a night scene described according to Van Gogh's "Starry Night". ps, there are 67 words in this paragraph.

Googles new AI is hot! You can draw the longest word in the world

Parti is no problem, I have drawn all the pictures of various styles for you in one package~

Googles new AI is hot! You can draw the longest word in the world

This is also Parti’s second greatest ability. Not only are the details in place, but the style can also be varied.

There are also strange descriptions like "a raccoon wears a formal suit, a top hat, a cane, and a garbage bag", which can also create a flowery work without losing detail.

In terms of styles, there are Van Gogh style, Egyptian Pharaoh style, pixel style, traditional Chinese painting style, abstract style...

Googles new AI is hot! You can draw the longest word in the world

Even sometimes It also makes pun jokes.

Googles new AI is hot! You can draw the longest word in the world

（Toad'ay, toad）

Specifically in the test results, MS-COCO, Localized Narrative (LN, 4 times longer description) FID scores,Parti both achieve state-of-the-art results.

Googles new AI is hot! You can draw the longest word in the world

Especially the FID score of MS-COCO zero sample is only 7.23, and the fine-tuned FID score is 3.22, exceeding the previous Imagen and DALL-E 2.

All components are Transformers

After a month, Google has taken AI painting to a new level, but the author said: the secret is very simple.

Googles new AI is hot! You can draw the longest word in the world

Parti mainly treats text generated images as sequence-to-sequence modeling. This is somewhat similar to machine translation, where text tokens are given as input to the encoder, and the target output changes from text to an image.

Structurally, all its components have only three parts: encoder, decoder and image tagger, and they are all based on the standard Transformer.

Googles new AI is hot! You can draw the longest word in the world

First, the image is encoded into a discrete labeled sequence using the Transformer-based image tagger ViT-VQGAN.

Then the parameters are expanded to 20 billion through the encoding-decoding structure of Transformer.

Previous research on image generation from text, except for the earliest GAN, can be roughly divided into two ideas.

One is based on the autoregressive model. First, text features are mapped to image features, and then a sequence architecture similar to Transformer is used to learn the relationship between language input and image output.

A key component of this approach is the image tagger, which converts each image into a sequence of discrete units. For example, DALL-E and CogView adopt this idea.

The other is a route that has been making frequent progress during this period-text-to-image models based on diffusion, such as DALL-E 2 and Imagen.

They abandoned the image tagger and instead used a diffusion model to generate images directly. What can be seen is that these models produce higher quality images and score better on MS-COCO zero-shot FID.

Googles new AI is hot! You can draw the longest word in the world

#The success of the Parti model proves that the autoregressive model can be used to improve the effect of text-generated images.

At the same time, Parti also introduced and released a new benchmark test - PartiPrompts, which is used to measure the model's ability in 12 categories and 11 challenges.

Googles new AI is hot! You can draw the longest word in the world

But Parti still has certain limitations, and the researchers also showed some bugs:

For example, the negative description is useless~

A plate without bananas, and a glass without orange juice next to it.

Googles new AI is hot! You can draw the longest word in the world

Also makes some common sense mistakes, such as scaling unreasonably. For example, in this picture, the robot is several times taller than a racing car.

Googles new AI is hot! You can draw the longest word in the world

A shiny robot wearing a racing suit and black visor stands proudly in front of an F1 car. The sun sets over the cityscape. Comic book illustration.

Google “roll your own”

This study comes from Google Research, and most of the team members are Chinese.

Googles new AI is hot! You can draw the longest word in the world

The core research staff include Yuanzhong Xu, Thang Luong, etc., who are currently working at Google in AI-related research.

(Thang Luong has been cited up to 20,000 times on Google Scholar)

Googles new AI is hot! You can draw the longest word in the world

△Left: Yuanzhong Xu; Right: Thang Luong

But what’s interesting is that Imagen, which is both “say a word and let AI draw” and is produced by Google, is inextricably related to Parti.

It is mentioned in Parti’s GitHub project documentation:

Thanks to the Imagen team for sharing it with us before releasing Imagen Its most recent complete results.

Their important findings in CF-guidance were particularly helpful for the final Parti model.

Googles new AI is hot! You can draw the longest word in the world

And one of the authors of Imagen, Burcu Karagol Ayan, also participated in Parti’s project.

(It's like Google "roll it yourself")

Not only that, even Aditya Ramesh, the author of "Next Door" DALL-E 2, also rated Parti in MS-COCO Discussion work was done on this aspect.

and the authors of DALL-Eval also provided help with the Parti data work.

One More Thing

One thing to say is that "text-generated images" is not just a darling of researchers.

Netizens are endlessly enjoying "playing" with it (don't be too imaginative).

A while ago, I asked Imagen to draw a Song Dynasty "Tiger wearing VR", which directly evolved into an AI painting battle.

Googles new AI is hot! You can draw the longest word in the world

△Picture: Art by Imagen

DALL·E, MidJourney and others "came after hearing the news" to participate.

Googles new AI is hot! You can draw the longest word in the world

△ Drawing by DALL·E

There are even people who brought Wordle and DALL-E 2 together:

Googles new AI is hot! You can draw the longest word in the world

......

But returning to Parti this time is fun, but some netizens still raised questions that "cut straight to the soul":

Googles new AI is hot! You can draw the longest word in the world

When will it be commercialized? It would be pointless to "play behind closed doors" by yourself.

Parti paper address:

https://parti.research.google/

GitHub project address ：

https://github.com/google-research/parti

Reference link:

[1]https:/ /twitter.com/lmthang/status/1539664610596225024[2]https://gizmodo.com/new-browser-game-combines-dall-e-mini-and-wordle-1849105289[3]https://imagen.research .google/

The above is the detailed content of Google's new AI is hot! You can draw the longest word in the world. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]May 14, 2025 am 05:04 AM

ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

Calculating The Risk Of ASI Starts With Human MindsMay 14, 2025 am 05:02 AM

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

An easy-to-understand explanation of how to write and compose lyrics and recommended tools in ChatGPTMay 14, 2025 am 05:01 AM

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

What is ChatGPT-4? A thorough explanation of what you can do, the pricing, and the differences from GPT-3.5!May 14, 2025 am 05:00 AM

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

Explaining how to use the ChatGPT app! Japanese support and voice conversation functionMay 14, 2025 am 04:59 AM

ChatGPT App: Unleash your creativity with the AI assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

How do I use the Chinese version of ChatGPT? Explanation of registration procedures and feesMay 14, 2025 am 04:56 AM

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

5 AI Agent Myths You Need To Stop Believing NowMay 14, 2025 am 04:54 AM

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

An easy-to-understand explanation of the illegality of creating and managing multiple accounts using ChatGPTMay 14, 2025 am 04:50 AM

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI

See all articles