search
HomeTechnology peripheralsAIUsing Apple Vision Pro to control robots from a distance, NVIDIA: It's not difficult to 'integrate man and machine”

Huang Renxun said: "The next wave of AI is robots, and one of the most exciting developments is humanoid robots." Today, Project GR00T has taken another important step.

Yesterday, NVIDIA founder Huang Jensen talked about its universal basic model of humanoid robot "Project GR00T" in his SIGGRAPH 2024 Keynote speech. The model receives a series of updates in terms of functionality.

Assistant Professor at the University of Texas at Austin and NVIDIA Senior Research Scientist Yuke Zhu tweeted, demonstrating in the video how NVIDIA integrates the general household robot large-scale simulation training framework RoboCasa and MimicGen system into the NVIDIA Omniverse platform and Isaac robot development platform .

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛

                                                                                                           Covers Nvidia's three computing platforms, including AI, Omniverse and Jetson Thor, Leverage them to simplify and accelerate developer workflows. Through the joint empowerment of these computing platforms, we are expected to enter the era of humanoid robots driven by physical AI.

The biggest highlight is that developers can use Apple Vision Pro to remotely control humanoid robots to perform tasks.

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛Meanwhile, another Nvidia senior research scientist, Jim Fan, said the updates to Project GR00T are exciting. NVIDIA uses a systematic approach to scaling robotics data to solve the toughest challenges in robotics.

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛The idea is also very simple: humans collect demonstration data on real robots, and NVIDIA expands these data by a thousand times or more in simulation. With GPU-accelerated simulation, people can now exchange computing power for the time-consuming, labor-intensive and costly work of humans collecting data.

He talks about how not so long ago he thought remote control was fundamentally unscalable because in the atomic world we were always limited to 24 hours/robots/days. NVIDIA's new synthetic data pipeline on GR00T breaks this limitation in the bit world.

                                                                                                

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛

Regarding Nvidia’s latest progress in the field of humanoid robots, some netizens said that Apple Vision Pro has found the best solution. Cool use case.

NVIDIA is beginning to lead the next wave: physical AI用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛

NVIDIA also detailed the technical process of accelerating humanoid robots in a blog. The full content is as follows:

To accelerate the development of humanoid robots worldwide, NVIDIA announced a set of services, models and computing platforms for the world's leading robot manufacturers, AI model developers and software manufacturers to develop, train and build the next generation of humanoid robots.
This suite of products includes new NVIDIA NIM microservices and frameworks for robotics simulation and learning, NVIDIA OSMO orchestration services for running multi-stage robotics workloads, and AI and simulation-enabled remote operations workflows that allow development Researchers use small amounts of human demonstration data to train robots.

Jensen Huang said: "The next wave of AI is robots, and one of the most exciting developments is humanoid robots. We are advancing the development of the entire NVIDIA robot stack, open to humanoid robot developers and companies around the world Access, allowing them to use the platforms, acceleration libraries and AI models that best fit their needs.

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛

Accelerate development with NVIDIA NIM and OSMO

NIM microservices are powered by NVIDIA inference software. of pre-built containers that enable developers to reduce deployment time from weeks to minutes.

Two new AI microservices will allow robotics experts to enhance generative physics AI simulation workflows in NVIDIA Isaac Sim.

The MimicGen NIM microservice generates synthetic motion data from remote data recorded from spatial computing devices such as the Apple Vision Pro. Robocasa NIM microservices generate robotic tasks and simulation environments in OpenUSD.

Cloud-native managed service NVIDIA OSMO is now available, allowing users to orchestrate and scale complex robotics development workflows across distributed computing resources, whether on-premises or in the cloud. The emergence of OSMO greatly simplifies robot training and simulation workflows, shortening deployment and development cycles from months to less than a week.

Provides advanced data capture workflow for humanoid robot developers

Training the underlying models behind humanoid robots requires a large amount of data. One way to obtain human demonstration data is to use remote operations, but this is becoming increasingly expensive and lengthy.

Through the NVIDIA AI and Omniverse remote manipulation reference workflow demonstrated at the SIGGRAPH computer graphics conference, researchers and AI developers can generate large amounts of synthetic motion and perception data from a very small number of remotely captured human demonstrations.

用苹果Vision Pro隔空操控机器人,英伟达:「人机合一」也不难嘛

First, developers used Apple Vision Pro to capture a handful of remote demos. They then simulated the recordings in NVIDIA Isaac Sim and used the MimicGen NIM microservice to generate synthetic datasets from the recordings.

Developers use real and synthetic data to train the Project GR00T humanoid robot base model, saving a lot of time and reducing costs. They then used the Robocasa NIM microservice in Isaac Lab, a robot learning framework, to generate experiences to retrain the robot model. Throughout the entire workflow, NVIDIA OSMO seamlessly distributes computing tasks to different resources, saving developers weeks of administrative workload.

Expand access to NVIDIA humanoid robot developer technology

NVIDIA offers three computing platforms to simplify the development of humanoid robots: NVIDIA AI supercomputer for training models; built on Omniverse the NVIDIA Isaac Sim, which allows robots to learn and perfect skills in a simulated world; and the NVIDIA Jetson Thor humanoid robotics computer used to run the model. Developers can access and use all or parts of the platform based on their specific needs.

Through the new NVIDIA Humanoid Developer Program, developers can gain early access to new products and the latest versions of NVIDIA Isaac Sim, NVIDIA Isaac Lab, Jetson Thor and Project GR00T universal humanoid base models.

1x, Boston Dynamics, ByteDance, Field AI, Figure, Fourier, Galbot, LimX Dynamics, Mentee, Neura Robotics, RobotEra and Skild AI are the first companies to join the early access program.

Developers can now join the NVIDIA Humanoid Developer Program to gain access to NVIDIA OSMO and Isaac Lab, and will soon gain access to NVIDIA NIM microservices.

Blog link:
https://nvidianews.nvidia.com/news/nvidia-accelerates-worldwide-humanoid-robotics-development

The above is the detailed content of Using Apple Vision Pro to control robots from a distance, NVIDIA: It's not difficult to 'integrate man and machine”. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Are You At Risk Of AI Agency Decay? Take The Test To Find OutAre You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaHow to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIRevisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkUnderstanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedThe Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniInsights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphA Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools