The 'secret' of robot implementation: continuous learning, knowledge transfer and autonomous participation-AI-php.cn

Home

Technology peripherals

The 'secret' of robot implementation: continuous learning, knowledge transfer and autonomous participation

王林

Apr 12, 2023 pm 08:22 PM

robotAmazonchallenge

The secret of robot implementation: continuous learning, knowledge transfer and autonomous participation

This article is reproduced from Lei Feng.com. If you need to reprint, please go to the official website of Lei Feng.com to apply for authorization.

On May 23, 2022, ICRA 2022 (IEEE International Conference on Robotics and Automation), the annual top international conference in the field of robotics, was held as scheduled in Philadelphia, USA.

This is the 39th year of ICRA. ICRA is the flagship conference of the IEEE Robotics and Automation Society and the primary international forum for robotics researchers to present and discuss their work.

At this year’s ICRA, three of Amazon’s chief robotics experts, Sidd Srinivasa, Tye Brady and Philipp Michel, briefly discussed the challenges of building robotic systems for human-machine interaction in the real world.

The secret of robot implementation: continuous learning, knowledge transfer and autonomous participation

Note: From left to right are Sidd Srinivasa, director of artificial intelligence for Amazon Robotics, Tye Brady, chief technical expert of Amazon Robotics (Global), and senior manager of applied science at Amazon Scout Philipp MichelSidd

Srinivasa is a world-renowned robotics expert, IEEE Fellow, currently a Distinguished Professor at Boeing at the University of Washington, and the leader of the Amazon Robot Artificial Intelligence Project. He is responsible for managing the algorithms of autonomous robots that assist Amazon logistics center employees. Research into robots that can pack and package products and cart-style robots that can autonomously lift, unload, and transport goods.

Tye Brady is the chief technical expert of Amazon Robotics (global) and has a master's degree in aerospace engineering from MIT. Philipp Michel and Sidd Srinivasa are both doctoral alumni of the CMU Robotics Institute and are senior managers of Amazon’s Scout robot project.

They put forward their own views on solving the challenges of robot landing. The AI technology review has been compiled without changing its original meaning, as follows:

Q: Your research in the field of robotics solves different problems. What are the similarities between these problems?

Sidd Srinivasa: An important difficulty in robotics research is that we live in an open world. We don’t even know what the “input” is about to face. In our fulfillment center, I have over 20 million items to control, and the number of items is increasing by the hundreds every day. Most of the time, our robots don't know what the items they are picking up are, but they need to pick them up carefully and package them quickly without damaging them.

Philipp Michel: For Scout, the difficulty is the objects encountered on the sidewalk, and the environment of transportation. We have private delivery facilities deployed in four states across the United States. Weather conditions, lighting conditions... it was clear from the beginning that we had to deal with a large number of variables to enable the robot to adapt to complex environments.

Tye Brady: In the process of developing execution robots, we have a significant advantage in that we operate in a semi-structured environment. We can make our own traffic rules for robots, and understanding the environment really helps our scientists and engineers gain a deep understanding of the objects we want to move, manipulate, classify, and identify to fulfill orders. In other words, we can realize the pursuit of technology in the real world.

Philipp Michel: Another thing we have in common is that we rely heavily on learning from data to solve problems. Scout receives real-world data as it performs tasks and then iteratively develops machine learning solutions for perception, localization, and navigation.

Sidd Srinivasa: I completely agree (learning to solve problems from data). I think machine learning and adaptive control are key to super-linear scaling. If we deploy thousands of robots, we can't have thousands of scientists and engineers working on them. We need to rely on real-world data to achieve super-linear growth.

In addition, I think the open world will force us to think about how to "continuous learning". Our machine learning models are often trained based on some input data distributions, but because this is an open world, we will encounter the problem of "covariate shift", that is, the data we see does not match the distribution. , which causes machine learning models to often be overconfident for no reason.

Therefore, a lot of the work we do is to create "watchdogs" (watchdogs, a supervisory device) to identify when the input data distribution deviates from the distribution it was trained on. Then, we perform "importance sampling" so that we can pick out the data that has changed and retrain the machine learning model.

Philipp Michel: This is one of the reasons why we want to train the robot in different places, so that we can know early on the real-life data that the robot may encounter, which in turn forces us to Develop solutions that address new data.

Sidd Srinivasa: This is indeed a good idea. One of the advantages of having multiple robots is the system's ability to recognize changed content, retrain, and then share this knowledge with other robots.

Think of a story about a sorting robot: In a corner of the world, a robot encounters a new packaging type. At first, it was troubled because it had never seen anything like this before and couldn't recognize it. Then a new solution emerged: a robot that could transmit new packaging types to all the robots in the world. That way, when this new packaging type appears elsewhere, the other robots will know what to do with it. It is equivalent to having a "backup". When new data appears at one point, other points will know it, because the system has been able to retrain itself and share information.

Philipp Michel: Our robot is doing similar things. If our robots encounter new obstacles that they haven't encountered before, we try to adjust the model to recognize and deal with these obstacles, and then deploy the new model to all robots.

One of the things that keeps me up at night is the idea that our robots will encounter new objects on the sidewalk that they won’t encounter again for the next three years, such as: People on the sidewalk Gargoyles used to decorate lawns for Halloween, or people place an umbrella on a picnic table to make it look less like a "picnic table." In this case, all machine learning algorithms fail to recognize that this is a picnic table.

So part of our research is about how to balance common things that don’t need to be entangled with specific categories of things. If this is an open manhole cover, the robot must be good at identifying it, otherwise it will fall. But if it's just a random box, we probably don't need to know the hierarchy of the box, just that this is the object we want to walk around.

Sidd Srinivasa: Another challenge is that when you change your model, there may be unintended consequences. The changed model may not affect the robot's perception, but it may change the way the robot "brakes", causing the ball bearings to wear out after two months. In end-to-end systems, a lot of interesting future research is about "understanding the impact of changes in parts of the system on the performance of the entire system."

Philipp Michel: We spent a lot of time thinking about whether we should divide the different parts of the robot stack. Integration between them can bring many benefits, but it is also limited. One extreme case is camera-to-motor-to-torque learning, which is very challenging in any real-world robotics application. Then there's the traditional robotics stack, which is nicely divided into parts like localization, perception, planning, and control.

We also spent a lot of time thinking about how the stack should evolve over time, and what performance improvements are there when bringing these pieces closer together? At the same time, we want to have a system that remains as interpretable as possible. We attempt to maximize the integration of learning components leveraging the entire stack while preserving interpretability and the number of safety features.

Sidd Srinivasa: This is a great point. I completely agree with Philipp’s point of view. It may not be correct to use one model to rule all models. But often, we end up building machine learning models that share a backbone with multiple applied heads. What is an object and what does it mean to segment an object? It might be something like picking, stacking, or packing, but each requires a specialized head, riding on a backbone that specializes in tasks.

Philipp Michel: Some of the factors we consider are battery, range, temperature, space and computing constraints. So we need to be efficient with our models, optimize the models, and try to take advantage of the shared backbone as much as possible, like Sidd mentioned, different heads for different tasks.

The secret of robot implementation: continuous learning, knowledge transfer and autonomous participation

Caption: Amazon Scout is an autonomous delivery robot that can move at walking speed on public sidewalks and is currently undergoing field testing in four states in the United States.

Q: When I asked about the commonalities between your projects, one thing that came to mind is that your robots all work in the same environment as humans. Why does this complicate the issue?

Sidd Srinivasa: Robots are approaching human life, and we must respect all the complex interactions that occur in the human world. In addition to walking, driving, and performing tasks, there are also complex social interactions. What’s important for a robot is, first, to be conscious and, second, to be involved.

It's really hard, when you're driving, sometimes it's hard to tell what other people are thinking and to decide how to act based on what they're thinking. Just reasoning about the problem is hard, and then closing the loop is even harder.

If a robot is playing chess or playing against a human, it's much easier to predict what they're going to do because the rules are already well laid out. If you assume your opponents are optimal, you will do well even if they are suboptimal. This is guaranteed in some two-player games.

But the actual situation is not like this. When we play this kind of cooperative game that ensures a win-win situation, we find that it is actually difficult to predict accurately during the game, even if the collaborators have good intentions.

Philipp Michel: And the behavior of the human world changes greatly. Some pets completely ignore the robot, and some pets will walk towards the robot. The same goes for pedestrians, with some turning a blind eye to the robot and others walking right up to it. Children, in particular, are extremely curious and highly interactive. We need to be able to handle all situations safely, and these variability are exciting.

The above is the detailed content of The 'secret' of robot implementation: continuous learning, knowledge transfer and autonomous participation. For more information, please follow other related articles on the PHP Chinese website!

Statement

This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete

Can't use ChatGPT! Explaining the causes and solutions that can be tested immediately [Latest 2025]May 14, 2025 am 05:04 AM

ChatGPT is not accessible? This article provides a variety of practical solutions! Many users may encounter problems such as inaccessibility or slow response when using ChatGPT on a daily basis. This article will guide you to solve these problems step by step based on different situations. Causes of ChatGPT's inaccessibility and preliminary troubleshooting First, we need to determine whether the problem lies in the OpenAI server side, or the user's own network or device problems. Please follow the steps below to troubleshoot: Step 1: Check the official status of OpenAI Visit the OpenAI Status page (status.openai.com) to see if the ChatGPT service is running normally. If a red or yellow alarm is displayed, it means Open

Calculating The Risk Of ASI Starts With Human MindsMay 14, 2025 am 05:02 AM

On 10 May 2025, MIT physicist Max Tegmark told The Guardian that AI labs should emulate Oppenheimer’s Trinity-test calculus before releasing Artificial Super-Intelligence. “My assessment is that the 'Compton constant', the probability that a race to

An easy-to-understand explanation of how to write and compose lyrics and recommended tools in ChatGPTMay 14, 2025 am 05:01 AM

AI music creation technology is changing with each passing day. This article will use AI models such as ChatGPT as an example to explain in detail how to use AI to assist music creation, and explain it with actual cases. We will introduce how to create music through SunoAI, AI jukebox on Hugging Face, and Python's Music21 library. Through these technologies, everyone can easily create original music. However, it should be noted that the copyright issue of AI-generated content cannot be ignored, and you must be cautious when using it. Let’s explore the infinite possibilities of AI in the music field together! OpenAI's latest AI agent "OpenAI Deep Research" introduces: [ChatGPT]Ope

What is ChatGPT-4? A thorough explanation of what you can do, the pricing, and the differences from GPT-3.5!May 14, 2025 am 05:00 AM

The emergence of ChatGPT-4 has greatly expanded the possibility of AI applications. Compared with GPT-3.5, ChatGPT-4 has significantly improved. It has powerful context comprehension capabilities and can also recognize and generate images. It is a universal AI assistant. It has shown great potential in many fields such as improving business efficiency and assisting creation. However, at the same time, we must also pay attention to the precautions in its use. This article will explain the characteristics of ChatGPT-4 in detail and introduce effective usage methods for different scenarios. The article contains skills to make full use of the latest AI technologies, please refer to it. OpenAI's latest AI agent, please click the link below for details of "OpenAI Deep Research"

Explaining how to use the ChatGPT app! Japanese support and voice conversation functionMay 14, 2025 am 04:59 AM

ChatGPT App: Unleash your creativity with the AI assistant! Beginner's Guide The ChatGPT app is an innovative AI assistant that handles a wide range of tasks, including writing, translation, and question answering. It is a tool with endless possibilities that is useful for creative activities and information gathering. In this article, we will explain in an easy-to-understand way for beginners, from how to install the ChatGPT smartphone app, to the features unique to apps such as voice input functions and plugins, as well as the points to keep in mind when using the app. We'll also be taking a closer look at plugin restrictions and device-to-device configuration synchronization

How do I use the Chinese version of ChatGPT? Explanation of registration procedures and feesMay 14, 2025 am 04:56 AM

ChatGPT Chinese version: Unlock new experience of Chinese AI dialogue ChatGPT is popular all over the world, did you know it also offers a Chinese version? This powerful AI tool not only supports daily conversations, but also handles professional content and is compatible with Simplified and Traditional Chinese. Whether it is a user in China or a friend who is learning Chinese, you can benefit from it. This article will introduce in detail how to use ChatGPT Chinese version, including account settings, Chinese prompt word input, filter use, and selection of different packages, and analyze potential risks and response strategies. In addition, we will also compare ChatGPT Chinese version with other Chinese AI tools to help you better understand its advantages and application scenarios. OpenAI's latest AI intelligence

5 AI Agent Myths You Need To Stop Believing NowMay 14, 2025 am 04:54 AM

These can be thought of as the next leap forward in the field of generative AI, which gave us ChatGPT and other large-language-model chatbots. Rather than simply answering questions or generating information, they can take action on our behalf, inter

An easy-to-understand explanation of the illegality of creating and managing multiple accounts using ChatGPTMay 14, 2025 am 04:50 AM

Efficient multiple account management techniques using ChatGPT | A thorough explanation of how to use business and private life! ChatGPT is used in a variety of situations, but some people may be worried about managing multiple accounts. This article will explain in detail how to create multiple accounts for ChatGPT, what to do when using it, and how to operate it safely and efficiently. We also cover important points such as the difference in business and private use, and complying with OpenAI's terms of use, and provide a guide to help you safely utilize multiple accounts. OpenAI

See all articles