search
HomeTechnology peripheralsAILet Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.

Team members are all from Stanford University, and the CTO is also a die-hard fan of Taylor Swift.

The AI ​​video field is going crazy.

The carnival caused by Luma is not over yet, there is another challenger in the AI ​​video circle——

Stanford Proteus produced by the university team.

Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.

According to reports, Proteus is a low-latency basic model that can generate highly realistic and expressive characters.

For example, let the protagonist in a world-famous painting - Mona Lisa or the girl with a pearl earring - laugh unbridled and have a natural and smooth facial expression: Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.Let Audrey Hepburn change her past lady image and start hip-hop rap: Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Also let Si in "Harry Potter" Professor Nepp sings "Despacito":
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Proteus has just been released, and a number of big guys sent "congratulatory letters":

AI scientist Jia Yangqing praised that the quality of real-time artificial intelligence avatars is surprisingly good.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
NVIDIA scientist Jim Fan said that this project is impressive.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Early investor Brian Zhan wrote that the biggest problem with existing AI video tools, such as Runway and Pika, is that they can produce hallucinations, especially when generating videos containing humans. hour. Apparate Labs takes AI video generation to the next stage by solving problems such as temporal coherence and object constancy.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Outrageous! Lu Xun talked about tongue twisters

Proteus is a new generation of basic model for real-time human expression generation.

#You must know that even the most advanced and powerful generation models currently cannot fully realize the real-time generation of human expressions.

Existing models are slow and fail to provide intuitive control over the complex facial expressions and body movements of generated characters, and they still suffer from realism and expressiveness What is lacking.

Proteus adopts the most advanced latent diffusion model of transformer architecture. Its innovative latent space design ensures high real-time efficiency, and with the development of architecture and algorithm With continuous optimization, Proteus is able to achieve video streaming of more than 100 frames per second (100+ FPS).

In other words, with just a simple photo, Proteus is not only able to imitate human laughter, rapping, singing, blinking, smiling and conversation, but it can also perform much more So many vivid expressions and movements.

For example, the always serious Lu Xun talked about tongue twisters:
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford. Or let Madame Curie sing a cappella "Le Festin》: Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.Or hold a roundtable meeting for scientists:
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
According to the Proteus R&D team, they It is expected that Proteus can become a voice-controllable visual avatar, provide an intuitive interactive interface for artificial intelligence dialogue entities, and be seamlessly compatible with many multi-modal large language models to provide customized services for various application scenarios.

In response to this, many netizens have opened their minds -

"Just use Einstein's By fine-tuning the large language model with data, coupled with his vivid facial expressions, the great Einstein can become a teaching assistant and teach physics classes in person, so teenagers no longer have to worry about failing to learn science."
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Some netizens said, I love it so much. This year is definitely the year of AI video.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
The team behind Qidian

##This model is praised by big guys, small and Beautiful model, what kind of team is behind it?

According to the official website, this was developed by Apparate Labs at Stanford University.

Currently there are only 6 people in the team. Judging from the names and photos, 3 of them are Chinese.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
CEO and co-founder Dr. William Shen studied in the Department of Computer Science at Stanford University, co-supervised by well-known professors Silvio Savarese and Leonidas J. Guibas .

Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.

His research covers multiple fields of artificial intelligence, including computer vision, robotics, graphics, generative models and embodied intelligence. His papers have won many awards, such as winning the Best Paper Award at IEEE-CVPR and being a finalist for the Best Student Paper Award on RSS.

Previously, he also received a bachelor’s degree in computer science from Stanford University with an excellent GPA of 4.0.

Chief Technology Officer and Co-Founder Connor Lin is also a top student.

He studied at Carnegie Mellon University for his bachelor's and master's degrees, studying under Professor Keenan Crane. In 2020, he will go to Stanford University to pursue a PhD in computer science. He is currently a fourth-year doctoral student, co-supervised by professors Leonidas Guibas and Gordon Wetzstein.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Connor Lin’s research focuses on 3D prior knowledge and neural representation for 3D reconstruction, generation and editing. He was supported by the David Cheriton Stanford Graduate Scholarship.

During his PhD studies, he interned at Google Research, NVIDIA Research and Adobe Research. Previously, he worked as a software engineer at Google, responsible for the development of portrait mode for Pixel phones.

In addition, this guy has a wide range of interests. He likes travel and sports, cooking, badminton, swimming, board games and music. He is also a die-hard fan of Taylor Swift

Like Connor Lin, chief scientist Linqi (Alex) Zhou is also a doctoral student at Stanford University, supervised by Professor Stefano Ermon.
Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.
Previously, Linqi Zhou received a bachelor's degree in computer science and applied mathematics from the University of California, Los Angeles, under the supervision of Professors Song-Chun Zhu and Ying-Nian Wu.

He conducts research mainly in the fields of computer vision and machine learning, and is committed to building models that can understand the world in a structured and probabilistic way.

Reference link:
##https://apparate.ai/ stream.html

The above is the detailed content of Let Lu Xun speak tongue twisters and Hepburn play hip-hop. Another video model went viral and was founded by a Chinese doctor from Stanford.. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
10 Generative AI Coding Extensions in VS Code You Must Explore10 Generative AI Coding Extensions in VS Code You Must ExploreApr 13, 2025 am 01:14 AM

Hey there, Coding ninja! What coding-related tasks do you have planned for the day? Before you dive further into this blog, I want you to think about all your coding-related woes—better list those down. Done? – Let’

Cooking Up Innovation: How Artificial Intelligence Is Transforming Food ServiceCooking Up Innovation: How Artificial Intelligence Is Transforming Food ServiceApr 12, 2025 pm 12:09 PM

AI Augmenting Food Preparation While still in nascent use, AI systems are being increasingly used in food preparation. AI-driven robots are used in kitchens to automate food preparation tasks, such as flipping burgers, making pizzas, or assembling sa

Comprehensive Guide on Python Namespaces & Variable ScopesComprehensive Guide on Python Namespaces & Variable ScopesApr 12, 2025 pm 12:00 PM

Introduction Understanding the namespaces, scopes, and behavior of variables in Python functions is crucial for writing efficiently and avoiding runtime errors or exceptions. In this article, we’ll delve into various asp

A Comprehensive Guide to Vision Language Models (VLMs)A Comprehensive Guide to Vision Language Models (VLMs)Apr 12, 2025 am 11:58 AM

Introduction Imagine walking through an art gallery, surrounded by vivid paintings and sculptures. Now, what if you could ask each piece a question and get a meaningful answer? You might ask, “What story are you telling?

MediaTek Boosts Premium Lineup With Kompanio Ultra And Dimensity 9400MediaTek Boosts Premium Lineup With Kompanio Ultra And Dimensity 9400Apr 12, 2025 am 11:52 AM

Continuing the product cadence, this month MediaTek has made a series of announcements, including the new Kompanio Ultra and Dimensity 9400 . These products fill in the more traditional parts of MediaTek’s business, which include chips for smartphone

This Week In AI: Walmart Sets Fashion Trends Before They Ever HappenThis Week In AI: Walmart Sets Fashion Trends Before They Ever HappenApr 12, 2025 am 11:51 AM

#1 Google launched Agent2Agent The Story: It’s Monday morning. As an AI-powered recruiter you work smarter, not harder. You log into your company’s dashboard on your phone. It tells you three critical roles have been sourced, vetted, and scheduled fo

Generative AI Meets PsychobabbleGenerative AI Meets PsychobabbleApr 12, 2025 am 11:50 AM

I would guess that you must be. We all seem to know that psychobabble consists of assorted chatter that mixes various psychological terminology and often ends up being either incomprehensible or completely nonsensical. All you need to do to spew fo

The Prototype: Scientists Turn Paper Into PlasticThe Prototype: Scientists Turn Paper Into PlasticApr 12, 2025 am 11:49 AM

Only 9.5% of plastics manufactured in 2022 were made from recycled materials, according to a new study published this week. Meanwhile, plastic continues to pile up in landfills–and ecosystems–around the world. But help is on the way. A team of engin

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
3 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

Safe Exam Browser

Safe Exam Browser

Safe Exam Browser is a secure browser environment for taking online exams securely. This software turns any computer into a secure workstation. It controls access to any utility and prevents students from using unauthorized resources.

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools