CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture-AI-php.cn

Home

Technology peripherals

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

王林

Apr 12, 2023 pm 01:43 PM

robotalgorithmModel

Most of the furniture that people come into contact with in daily life are "articulated objects", such as drawers with pull-out rails, doors with vertical rotation axes, doors with horizontal rotation Shaft oven, because the main parts of these objects are connected by various joints.

Due to the existence of these joints, the various parts of the parts of the connected object are kinematically constrained by the joints, so these parts have only one degree of freedom (1 DoF). These items are everywhere in our lives, especially in our daily homes. They are an important part of our daily lives. We as humans see that no matter what kind of furniture we have, we can quickly figure out how to manipulate and control it. It's as if we know how every joint of these objects moves.

So can robots predict how furniture will move like humans? This kind of predictive ability is hard to come by, and if robots could learn this ability, it would be a huge boost for home robots.

Recently, two students in the R-PAD laboratory of Professor David Held of the CMU School of Robotics, Ben Eisner and Harry Zhang, have made breakthroughs in manipulating complex joint objects and launched a 3D-based FlowBot 3D for Neural Networks, an algorithm that effectively expresses and predicts the motion trajectories of parts of articulated objects, such as everyday furniture. The algorithm contains two parts.

The first part is the perception part, which uses a 3D deep neural network to predict the three-dimensional instantaneous motion trajectory from the point cloud data of the manipulated furniture object ( 3D Articulated Flow).

The second part of the algorithm is the policy part, which uses the predicted 3D Articulated Flow to select the robot's next action. Both are fully learned in the simulator and can be implemented directly in the real world without retraining or tuning. With the help of the FlowBot 3D algorithm, the robot can manipulate articulated objects such as everyday furniture at will, just like humans.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

##This paper is currently the world’s top robotics conference Robotics Science and Systems (RSS) 2022 best paper candidates (top 3%), and will be exhibited in New York, USA in July, competing with 7 other outstanding articles for the honor of the best paper.

Paper address: https://arxiv.org/pdf/2205.04382.pdf
Project homepage: https:// sites.google.com/view/articulated-flowbot-3d

FlowBot 3D relies solely on the simulator and performs supervised learning on simulated data to learn articulated objects such as everyday furniture Instantaneous motion trajectory of the part (3D Articulated Flow). 3D Articulated Flow is a visual point cloud trajectory representation method that can greatly simplify the complexity of the robot's next strategy and improve generalization and efficiency. The robot can complete the task of manipulating joint objects by simply following this instantaneous trajectory and re-predicting this trajectory in a closed loop.

Previously, the conventional method in academia for manipulating joint objects such as furniture was to calculate the movement direction of the part through the geometric characteristics of the manipulated object (such as the position and direction of the connected parts). , or by imitating expert strategies (usually from humans) to learn the operation of specific objects to complete complex actions of joint object manipulation. These traditional methods in academia do not have good generalization, and the efficiency of utilizing data is low. Training requires the collection of a large amount of human demonstration data. Unlike these, FlowBot 3D is the first purely simulator-based learning that does not require humans to provide any demonstration data, and the algorithm allows the robot to calculate the optimal object manipulation path by learning the instantaneous motion trajectory of each part, so the The algorithm has great generalizability. It is this feature that allows FlowBot 3D to generalize to objects invisible during simulator training, successfully manipulating real, everyday furniture items directly in the real world.

The following animations demonstrate the manipulation process of FlowBot 3D. On the left is the manipulated video, and on the right is the predicted instantaneous motion trajectory of the point cloud 3D Articulated Flow. The FlowBot 3D algorithm first enables the robot to identify which part on an object can be manipulated and then predict the direction of movement of that part.

Open the refrigerator door:

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

Open the toilet seat:

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

##Open the drawer:

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

##The reviewer of this paper said: Overall, this paper It is a considerable contribution to robot control science.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

So, how does FlowBot 3D learn this skill?

When humans see a new furniture item, such as a door, we know that the door rotates through a door axis, and we know the constraints of the door axis. This door can only rotate in one direction, so we can follow the direction imagined in our minds to open the door. Therefore, if you want a robot to be truly dexterous and effective in predicting the manipulation methods and motion trajectories of joint objects such as furniture, an effective way is to let the robot understand the kinematic constraints of these parts, so that it can predict the movement of these objects. trajectory.

The specific method of FlowBot 3D is not complicated and relies only on the simulator without the need for complicated real human data. In addition, another benefit of the simulator is that in the simulator, the 3D data files (URDF) of these household objects contain the kinematic constraints of each part and the specific parameters of the constraints, so the motion trajectory of each part is in the simulator. can be calculated accurately.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

Two modules for FlowBot 3D.

During simulator training, the robot observes the three-dimensional point cloud data of the manipulated object as input data to the robot vision module. The vision module (perception module) uses PointNet to predict the 3D articulated flow of the instantaneous motion trajectory of each point in the input point cloud under the action of external force (for example, after the drawer is opened 1cm, the door opens outward 5 degrees), using the three-dimensional coordinate vector expressed in poor form. The actual data of this motion trajectory can be accurately calculated through forward kinematics. By subtracting the current three-dimensional vector coordinate from the next three-dimensional vector coordinate, the motion trajectory of the manipulated object part can be obtained. Therefore, during training, only the L2 loss of the predicted 3D Articulated flow needs to be minimized for supervised learning.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

In this picture, the blue points are the observed point cloud data, and the red arrows represent the predicted facade. Motion trajectory 3D Articulated Flow.

By learning in this way, FlowBot 3D can learn the movement direction of each part under kinematic constraints and the situation where each point on the part is subject to the same force. The relative speed and relative direction of motion (velocity). Common household joint items are prismatic and revolute. For twitch parts, such as drawers, the movement direction and speed of each point on the drawer surface are the same when receiving the same external force. For rotating parts, such as doors, the direction of movement of each point on the door surface is the same when receiving the same external force, but the speed increases the further away from the rotation axis. The researchers used the physical laws in robotics (screw theory) to prove that the longest 3D Articulated Flow can maximize the acceleration of the object. According to Newton's second law, this strategy is the optimal solution.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

#Based on theoretical basis, in actual operation , what the robot needs to do is to predict the movement trajectory of each point through the vision module of FlowBot 3D. In each point trajectory, find the point corresponding to the longest 3D articulated flow direction as the manipulation point, and predict this manipulation in a closed loop The movement trajectory of the point. If the selected manipulation point cannot be successfully grasped (for example, the surface does not meet the grasping conditions of the robot hand), then FlowBot 3D will select the point with the second longest length that meets the grasping conditions.

In addition, due to the characteristics of PointNet, FlowBot 3D predicts the motion trajectory of each point and does not rely on the geometric characteristics of the object itself. It has a strong influence on the possible occlusion of the object by the robot. robustness. In addition, because this algorithm is closed-loop, the robot can correct its possible errors in the next step of prediction.

FlowBot 3D’s performance in the real worldFlowBot 3D has the ability to overcome generalization challenges in the real world. The design concept of FlowBot 3D is that as long as it can accurately predict the movement trajectory of the manipulated object 3D articulated flow, then the next step is to follow this trajectory to complete the task.

Another important point is that FlowBot 3D uses a single training model to manipulate multiple categories of items, including categories that have not been seen in training. And in the real world, the robot only needs to use the model obtained through this pure simulator training to manipulate a variety of real objects. Therefore, in the real world, since the kinematic constraints of household objects are overwhelmingly the same as in the simulator, FlowBot 3D can be directly generalized to the real world.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

Household items used by FlowBot3D in real-world experiments (including trash cans, refrigerators, toilet seats, boxes, safes, etc.

In the simulator, the robot is trained using some categories of household items, including staplers, bins, drawers, windows, refrigerators, etc. In the simulator and real-world tests, test data New objects from training categories and categories not seen during training.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

#FlowBot 3D manipulation tasks in the simulator.

In comparison, imitation-based simulations commonly seen in academia The learning method requires manual guidance to learn how to manipulate new objects, making it unrealistic for these robots to be implemented in the real world, especially in home robot scenarios. In addition, 3D point cloud data is stronger than the 2D RGB data used by other methods. Because point cloud allows the robot to understand each joint and the relationship between joints, it can understand and predict the movement trajectory of parts at a higher level, greatly enhancing generalization.

Experimental results show that FlowBot 3D can achieve a distance of less than 10% to "full open" when operating most objects (whether they are categories seen or not seen during training), and Success Ridge can reach More than 90. In comparison, other methods based on imitation learning (DAgger) or reinforcement learning (SAC) are far behind and lack generalization.

CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture

In short, FlowBot 3D is a job with great potential. It can be deployed efficiently in the real world without the need for fine-tuning. This work also shows that advances in computer vision can change the field of robotics, especially the visual expression of motion trajectories called 3D articulated flow, which can be applied to multiple tasks to simplify the robot strategy selection and decision-making process. With this generalizable expression, simulator learning methods will have the potential to be directly deployed in the real world, which will greatly reduce the cost of future home robot training and learning.

FlowBot 3D’s next planCurrently, the research team is trying to apply the flow understanding and prediction method to objects other than joint objects, such as how to use flow Predict object trajectories with 6 degrees of freedom. At the same time, the author is trying to use flow as a general visual expression to apply it to other robot learning tasks, such as reinforcement learning, thereby increasing learning efficiency, robustness, and generalizability.

Associate Professor David Held’s homepage: https://davheld.github.io/Ben Eisner’s homepage: https://beisner.me/Harry Zhang’s homepage: https:// harryzhangog.github.io/

The above is the detailed content of CMU publishes new dexterous robot algorithm that accurately learns how to operate everyday furniture. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Are You At Risk Of AI Agency Decay? Take The Test To Find OutApr 21, 2025 am 11:31 AM

This article explores the growing concern of "AI agency decay"—the gradual decline in our ability to think and decide independently. This is especially crucial for business leaders navigating the increasingly automated world while retainin

How to Build an AI Agent from Scratch? - Analytics VidhyaApr 21, 2025 am 11:30 AM

Ever wondered how AI agents like Siri and Alexa work? These intelligent systems are becoming more important in our daily lives. This article introduces the ReAct pattern, a method that enhances AI agents by combining reasoning an

Revisiting The Humanities In The Age Of AIApr 21, 2025 am 11:28 AM

"I think AI tools are changing the learning opportunities for college students. We believe in developing students in core courses, but more and more people also want to get a perspective of computational and statistical thinking," said University of Chicago President Paul Alivisatos in an interview with Deloitte Nitin Mittal at the Davos Forum in January. He believes that people will have to become creators and co-creators of AI, which means that learning and other aspects need to adapt to some major changes. Digital intelligence and critical thinking Professor Alexa Joubin of George Washington University described artificial intelligence as a “heuristic tool” in the humanities and explores how it changes

Understanding LangChain Agent FrameworkApr 21, 2025 am 11:25 AM

LangChain is a powerful toolkit for building sophisticated AI applications. Its agent architecture is particularly noteworthy, allowing developers to create intelligent systems capable of independent reasoning, decision-making, and action. This expl

What are the Radial Basis Functions Neural Networks?Apr 21, 2025 am 11:13 AM

Radial Basis Function Neural Networks (RBFNNs): A Comprehensive Guide Radial Basis Function Neural Networks (RBFNNs) are a powerful type of neural network architecture that leverages radial basis functions for activation. Their unique structure make

The Meshing Of Minds And Machines Has ArrivedApr 21, 2025 am 11:11 AM

Brain-computer interfaces (BCIs) directly link the brain to external devices, translating brain impulses into actions without physical movement. This technology utilizes implanted sensors to capture brain signals, converting them into digital comman

Insights on spaCy, Prodigy and Generative AI from Ines MontaniApr 21, 2025 am 11:01 AM

This "Leading with Data" episode features Ines Montani, co-founder and CEO of Explosion AI, and co-developer of spaCy and Prodigy. Ines offers expert insights into the evolution of these tools, Explosion's unique business model, and the tr

A Guide to Building Agentic RAG Systems with LangGraphApr 21, 2025 am 11:00 AM

This article explores Retrieval Augmented Generation (RAG) systems and how AI agents can enhance their capabilities. Traditional RAG systems, while useful for leveraging custom enterprise data, suffer from limitations such as a lack of real-time dat

See all articles

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Assassin's Creed Shadows: Seashell Riddle Solution

3 weeks agoByDDD

What's New in Windows 11 KB5054979 & How to Fix Update Issues

2 weeks agoByDDD

Where to find the Crane Control Keycard in Atomfall

3 weeks agoByDDD

Assassin's Creed Shadows - How To Find The Blacksmith And Unlock Weapon And Armour Customisation

1 months agoByDDD

Roblox: Dead Rails - How To Complete Every Challenge

3 weeks agoByDDD

Hot Tools

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),