Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ant's generative AI black technology-AI-php.cn

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ant's generative AI black technology

WBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWBOYWB

Sep 29, 2023 pm 11:57 PM

digital manindustrygenerative aiiccv

Open a digital human, which is full of generative AI.

On the evening of September 23, at the opening ceremony of the Hangzhou Asian Games, the lighting of the main torch showed the "little flames" of hundreds of millions of online digital torchbearers gathering on the Qiantang River. A digital human image is formed. Then, the digital human torchbearer and the sixth torchbearer on site walked to the torch stage together and lit the main torch together

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

As the core idea of the opening ceremony, the digital torch bearer The Internet's torch-lighting form has become a hot search topic and attracted people's attention. Rewritten content: As the core idea of the opening ceremony, the torch lighting method of Digital Reality Internet has aroused heated discussions and attracted people's attention.

Digital People Ignition is an unprecedented initiative, with hundreds of millions of people participating. , involving a large number of advanced and complex technologies. One of the most important issues is how to make digital people "move". It can be clearly seen that with the rapid development of generative artificial intelligence and large-scale models, more new changes have appeared in digital human research

At the upcoming global computer vision conference ICCV 2023 in early October, We noticed that a study on generating 3D digital human motion was included in the conference. The related paper is titled "Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models" and was jointly published by Zhejiang University and Ant Group.

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

According to the introduction, this research solves to a certain extent the problem of digital humans synthesizing complex movements over long distances, and can achieve effects that cannot be achieved with original models or path planning. Technology related to digital human driving has also been used in the online delivery of 100 million digital human beings in the Asian Games

Generative AI driver to make digital humans move

Many times , we need to synthesize 3D human motion in a given 3D scene so that virtual humans can naturally walk around the scene and interact with objects. This effect has many applications in AR/VR, film production, and video games.

Here, traditional character control motion generation methods aim to generate short-term or repetitive motions guided by the user's control signals, while new research focuses on generating a given starting position and target object model. Longer human-computer interaction content.

Although this idea is more effective, it is obviously more challenging. First, human-object interactions should be coherent, which requires the ability to model long-range interactions between humans and objects. Second, in the context of content generation, generative models should be able to synthesize motions of different sizes, since there are multiple ways for real people to approach and interact with target objects.

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

^{Figure 1. Generation of interactive images between people and objects. Given an object, the new method first predicts a set of milestone events, where the ring represents the position and the person in pink represents the original pose. The algorithm fills in actions between milestones. The diagram shows the new method using the same object to generate different milestones and actions. The flow of time is shown with a color code, with darker blue representing further frames.}

In terms of methods for generating digital human movements, existing synthesis methods can be roughly divided into online generation and offline generation. Most online methods focus on real-time control of the character. Given a target object, they typically use autoregressive models to cyclically generate future motion through feedback predictions. Although this method has been widely used in interactive scenarios such as video games, its quality is still unsatisfactory for long-term generation.

In order to improve the quality of motion, some recent offline methods have begun to adopt a multi-level framework, first generating trajectories and then synthesizing motion. Although this strategy can produce reasonable paths, the diversity of paths is limited

In this new study, the authors propose a new offline method for synthesizing long-term and diverse Interaction between people and objects. The innovation of this method lies in the hierarchical generation strategy. First, the strategy predicts a series of milestones and then generates human actions between those milestones

Specifically, given a starting position and a target object, the author designed a milestone generation module to synthesize a set of nodes along the movement trajectory. Each milestone encodes the local pose and indicates the transition during human movement. point. Based on these milestones, the algorithm employs a motion generation module to generate complete motion sequences. Thanks to the existence of these milestones, we can simplify the generation of long sequences to the synthesis of several short motion sequences.

The local pose of each milestone is generated by a transformer model that considers global dependencies to produce time-consistent results, further facilitating coherent motion

In addition to the hierarchical generation framework, The researchers further used diffusion models to synthesize human-object interactions. Some previous motion synthetic diffusion models combined transformers and denoising diffusion probabilistic models (DDPM).

It is worth mentioning that due to the long motion sequences, applying them directly to the new settings requires a lot of calculations and may cause GPU memory explosion. Because the new hierarchical generation framework converts long-term generation into the synthesis of multiple short sequences, the GPU memory required is reduced to the same level as short-term motion generation.

Therefore, researchers can effectively use Transformer DDPM to synthesize long-term motion sequences, thereby improving the generation quality

To achieve this purpose, researchers designed a hierarchical generation framework, as shown in the figure below Show

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

First, they use GoalNet to predict interaction targets on objects, and then generate target poses to explicitly model human-object interactions. Next, they use the milestone generation module to estimate the length of the milestone, thereby generating the milestone trajectory from the starting point to the target, and place the milestone pose

In this way, the long-distance motion generation is decomposed into multiple short-distance Motion generated combinations. Finally, the authors designed a motion generation module to synthesize trajectories between milestones and fill in actions.

Artificial Intelligence (AI) Posture Generation

Researchers refer to the posture in which a person interacts with an object and remains stationary as the target posture. Previously, most methods used cVAE models to generate human poses, but researchers found that this method performed poorly in their own studies.

To address this challenge, they adopted the VQ-VAE model to model the data distribution. This model utilizes discrete representation to cluster data into a limited set of points. Furthermore, based on observations, different human poses may have similar properties. For example, when a person is sitting down, the hand movements may be different, but the leg position may be the same. Therefore, they divided the joints into L (L = 5) different non-overlapping groups

As shown in Figure 3, the target pose was divided into independent joint groups

Based on the starting pose and target pose, we can let the algorithm generate the milestone trajectory and synthesize the local pose at the milestone. Since the length of the motion data is unknown and can be arbitrary (for example, a person may quickly walk to the chair and sit down, or he may walk slowly around the chair and then sit down), it is necessary to predict the length of the milestone, represented by N . Then, N landmark points are synthesized and local poses are placed on these points.

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

The last step is action generation. The method used by the researchers is not to predict actions frame by frame, but to synthesize the entire sequence hierarchically based on the generated milestones. They first generate trajectories and then synthesize actions. Specifically, within two consecutive milestones, they complete the trajectory first. Then, fill in the movement guided by successive milestone gestures. These two steps are completed using two Transformer DDPM respectively.

The researcher will carefully design the conditions of DDPM for each step to generate the target output

The rewritten content is: the effect of being ahead of other products

The researchers compared the results of different methods on the SAMP dataset. It can be seen that the method proposed in the paper has lower FD, higher user research score and higher APD. Furthermore, their method achieves higher trajectory diversity than SAMP.

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

This new method can produce satisfactory results in complex scenes. The percentage of penetration frames generated by this method is 3.8%, and that of SAMP is 4.9%

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

On SAMP, COUCH and other data sets, the methods mentioned in the study have achieved Better results than baseline methods

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology

Complete full-link layout

Digital human is a multi-modal combination of voice, semantics, vision, etc. A combination of dynamic technologies. While generative AI has recently made breakthroughs, the field of digital humans is experiencing leapfrog development. The modeling, generation interaction, rendering and other aspects that previously required manual work are now being fully artificialized. As engineers continue to Optimization, the experience of this technology on the mobile terminal is also getting better. The just-concluded online Asian Games torch relay event is a good example: if we want to become a torch bearer, we only need to click on the mini program of the Alipay App.

It is said that in order to ensure the smooth progress of the opening ceremony project, Ant Group’s engineers conducted more than 100,000 tests on hundreds of different models of mobile phones, typed more than 200,000 lines of code, and passed self-research The combination of Web3D interactive engine Galacean, AI digital human, cloud services, blockchain and other technologies ensures that everyone can become a digital torchbearer and participate in the torch relay. The Asian Games Digital Torchbearer Platform can reach hundreds of millions of users and supports 97% of common smartphone devices.

In order to allow digital torchbearers to participate realistically, Ant’s technical team developed 58 face-pinching controllers. By using facial recognition and AI algorithms, they can map a digital torchbearer's face based on each person's facial features. At the same time, users can also freely adjust face shape, hairstyle, nose, mouth, eyebrows and other features to achieve free dress-up. This technology can provide 2 trillion different digital image choices

In addition, after the opening ceremony lighting ceremony, each digital torch bearer can receive an exclusive digital ignition certificate with each digital torch painted on it. With a unique image of your hand, this certificate will be stored on the blockchain through distributed technology.

Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ants generative AI black technology It is easy to see from the content of the research paper and the Asian Games projects that there is support from a complete digital human technology system behind it. It is understood that Ant Group is actively exploring digital human technology and has completed the self-research layout of the full-link core technology of digital human.

Unlike most companies on the market, Ant Group’s digital human technology is self-developed and has chosen a development direction that is combined with generative AI. In terms of technical deployment, it covers the entire life cycle of digital human modeling, rendering, driving, and interaction. Combining AIGC and large models significantly reduces the full-link production cost of digital humans. Currently, it can support 2D and 3D digital people, and provides a variety of solutions such as broadcast type and interactive type.

According to public information, it can be summarized that the Ant Digital Human Platform currently has four technical advantages and features:

In addition to the Asian Games, the Ant Digital People Platform also supports Ant Group’s Alipay, digital finance, government affairs, Wufu and other businesses, and this year began to apply it to short videos, live broadcasts, mini programs and other carriers to partners Provide basic services.

It can be predicted that in the near future, as digital humans powered by generative AI continue to upgrade, we will also experience better interactions in more scenarios, and truly enter a smart life integrating digital and real things.

The above is the detailed content of Digital people light the main torch of the Asian Games, and this ICCV paper reveals Ant's generative AI black technology. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:机器之心. If there is any infringement, please contact admin@php.cn delete

Are You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

SAP NetWeaver Server Adapter for Eclipse

Integrate Eclipse with SAP NetWeaver application server.

MinGW - Minimalist GNU for Windows

This project is in the process of being migrated to osdn.net/projects/mingw, you can continue to follow us there. MinGW: A native Windows port of the GNU Compiler Collection (GCC), freely distributable import libraries and header files for building native Windows applications; includes extensions to the MSVC runtime to support C99 functionality. All MinGW software can run on 64-bit Windows platforms.

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

Hot Topics

Where is the login entrance for gmail email?

7627

CakePHP Tutorial

1389

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

140