search
HomeTechnology peripheralsAIWhile I was still chatting with ChatGPT, someone had already started using it to control the robot's work.

Have you ever wanted to tell a robot what to do with your own words, like you would a human?

For example, just tell your home assistant robot "Please heat up my lunch" and it will find the microwave on its own. Amazing, right?

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

Although language is the most intuitive way for humans to express their intentions, for a long time, people still relied heavily on handwritten code to implement robots. control. However, when ChatGPT appears, this situation will change.

In a recent study, a Microsoft team is exploring how to use OpenAI’s new AI language model ChatGPT to make natural human-computer interaction possible.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

Paper link: https://www.microsoft.com/en-us/research/uploads/prod/ 2023/02/ChatGPT___Robotics.pdf

ChatGPT is a language model trained on a large corpus of text and human interactions, so it can generate coherent and grammatical responses to a wide range of prompts and questions. Correct response. The goal of this research is to see if ChatGPT can think beyond text and reason about the real world to help robots complete tasks. Researchers hope this will make it easier for people to interact with robots without having to learn complex programming languages ​​or the details of robotic systems.

The key challenge of the research is to teach ChatGPT how to solve problems by taking into account the laws of physics, the operating environment, and the way the robot uses body movements to change its surroundings.

It turns out that ChatGPT can do a lot on its own, but it still needs some help. In the paper, the team describes a series of design principles that can be used to guide language models in solving robotic tasks, including (but not limited to) ad hoc prompt structures, high-level APIs, and feedback via human text. The researchers believe this work is just the beginning of a transformation in developing robotic systems and hope this study will inspire other researchers to join this interesting research area.

Challenges faced by today’s robotics technology and the help ChatGPT can provide

The current robot operation process is from Starting with an engineer or technical user, they are required to translate task requirements into system code. Engineers will be in the loop of the workflow, constantly writing new code and specifications to correct the robot's behavior. Overall, the process is slow (users need to write low-level code), expensive (requires highly skilled users with in-depth knowledge of robotics), and inefficient (requires multiple interactions to function properly).

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

But ChatGPT opens up a new bot paradigm and allows for potentially non-technical Users participate in the loop, monitoring the robot's performance while providing high-level feedback to the large language model (LLM). By following the researched design principles, ChatGPT can generate code for robotic scenarios. Without any fine-tuning, the study exploits the knowledge of LLM to control different robot shapes for various tasks. In their work, the researchers demonstrated multiple examples of ChatGPT solving robotic challenges, as well as complex robot deployments in the operational, aerial, and navigation domains.

Bots and ChatGPT: Design Principles

Prompting LLM is a highly empirical science. The research established a set of methods and design principles for writing prompts for robot tasks through trial and error:

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

  • First, the study defines a set of high-level robot APIs or function libraries. This library can be specific to a particular robot and map to an existing low-level implementation in the robot's control stack or perception library. It is important to use descriptive names for high-level APIs so that ChatGPT can infer their behavior;
  • Next, the researchers wrote a text prompt for ChatGPT that described the task goal while explicitly stating the Which functions are available. Prompts can also include information about task constraints, or how ChatGPT should form its answers (a specific encoding language, using auxiliary parsing elements);
  • The user evaluates ChatGPT's code output in the loop, either by direct inspection, Or use an emulator. If desired, users can provide feedback to ChatGPT on the quality and safety of their answers using natural language.

If the user is satisfied with the solution, the code can finally be deployed to the robot.

Enough with the theory, what exactly can ChatGPT do?

Let’s look at a few examples, you can find more case studies in the code repository.

Zero-Sample Mission Plan​

Researchers let ChatGPT control real drones, and it proved to be the most effective among non-technical users and robots A very intuitive language-based interface. When user instructions are ambiguous, ChatGPT asks clarifying questions and writes complex code structures for the drone to visually inspect the architecture, such as zigzag patterns. It even learned to take selfies!

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

Researchers ChatGPT was also used in a simulated industrial inspection scenario using the Microsoft AirSim simulator. The model is able to effectively parse the user's high-level intentions and geometric cues to accurately control the drone.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

##In the loop Users of: When complex tasks require dialogue​

Next, the researchers used ChatGPT for robotic arm operation scenarios and used conversational feedback to teach the model how to convert the initially provided API composition into more complex high-level functions: ChatGPT automated programming. The model is able to logically link learned skills together using curriculum-based strategies to perform actions such as stacking blocks.

Additionally, this mockup shows a perfect example of connecting the textual and physical domains when building the Microsoft logo out of wooden blocks. Not only is it able to recall the logo from an internal knowledge base, it is also able to "draw" the logo (as SVG code) and then use the skills learned above to figure out which of the existing robot's movements can make up its appearance.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

Next, the researchers asked ChatGPT to write an algorithm that would allow the drone to reach Aim in the air without hitting obstacles. They told the model that the drone had a forward-facing range sensor, and ChatGPT immediately programmed most of the key building blocks for the algorithm. This task requires some conversation with a human, and ChatGPT's ability to make localized code improvements using only linguistic feedback is impressive.

Perception-Action Loop: The robot perceives the world before acting

The ability to perceive the world (perception) before doing something (action) It is the basis of any robotic system. So the researchers decided to test ChatGPT's understanding of this concept and asked it to explore an environment until it found a user-specified object. The study provides the model with features such as object detection and object distance APIs, and verifies that the code it generates successfully implements the sense-action loop.

During the experimental phase, the researchers conducted additional experiments to evaluate whether ChatGPT could make real-time decisions about where the robot should go based on sensor feedback (rather than having ChatGPT generate a code loop to make the decision) these decisions). Interestingly, it happened to verify that a textual description of the camera image could be fed into each step of the conversation, and the model was able to figure out how to control the robot until it reached a specific object.

PromptCraft, LLM Collaborative open source tool for robotics research

Good Prompt engineering is crucial to the success of large language models, such as ChatGPT for robotic tasks. Unfortunately, Prompt is an empirical science, and there is a lack of comprehensive and accessible resources, including a mixed bag of examples to help researchers and enthusiasts in the field. To bridge this gap, the researchers introduced "PromptCraft", a collaborative open source platform where anyone can share examples of Prompt strategies for different robot categories, and the researchers published all the tools used in this study. Prompt and dialogue.

In addition to the Prompt design, the research hopes to include multiple bot simulators and interfaces that allow users to test their ChatGPT-generated algorithms. As a start, the research has also released an AirSim environment integrated with ChatGPT that anyone can use to develop these ideas.

While I was still chatting with ChatGPT, someone had already started using it to control the robots work.

ChatGPT-AirSim interface

Let robots go out of the laboratory and into the world

The release of these technologies is something worth celebrating, Because this will expand the audience for robotics. Microsoft researchers believe that language-based robot control will lay the foundation for bringing robots from scientific laboratories into the lives of everyday users.

This article would like to emphasize that the output of ChatGPT is not meant to be deployed directly on a robot without careful analysis. The researchers encourage users to harness the power of simulation in order to evaluate these algorithms before potential real-life deployment and always take necessary safety precautions. The work described in this article represents only a small portion of what is possible at the intersection of large-scale language models operating in the field of robotics, and hopefully provides inspiration for more research.

Original link: https://www.microsoft.com/en-us/research/group/autonomous-systems-group-robotics/articles/chatgpt-for-robotics/

The above is the detailed content of While I was still chatting with ChatGPT, someone had already started using it to control the robot's work.. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Word文本框没有旋转按钮怎么办Word文本框没有旋转按钮怎么办Dec 08, 2022 am 09:50 AM

Word文本框没有旋转按钮的解决办法:打开兼容模式文档后按F12键另存为高版本,再打开就可以了。

令人惊艳的4个ChatGPT项目,开源了!令人惊艳的4个ChatGPT项目,开源了!Mar 30, 2023 pm 02:11 PM

自从 ChatGPT、Stable Diffusion 发布以来,各种相关开源项目百花齐放,着实让人应接不暇。今天,着重挑选几个优质的开源项目分享给大家,对我们的日常工作、学习生活,都会有很大的帮助。

Word文档拆分后的子文档字体格式变了怎么办Word文档拆分后的子文档字体格式变了怎么办Feb 07, 2023 am 11:40 AM

Word文档拆分后的子文档字体格式变了的解决办法:1、在大纲模式拆分文档前,先选中正文内容创建一个新的样式,给样式取一个与众不同的名字;2、选中第二段正文内容,通过选择相似文本的功能将剩余正文内容全部设置为新建样式格式;3、进入大纲模式进行文档拆分,操作完成后打开子文档,正文字体格式就是拆分前新建的样式内容。

学术专用版ChatGPT火了,一键完成论文润色、代码解释、报告生成学术专用版ChatGPT火了,一键完成论文润色、代码解释、报告生成Apr 04, 2023 pm 01:05 PM

用 ChatGPT 辅助写论文这件事,越来越靠谱了。 ChatGPT 发布以来,各个领域的从业者都在探索 ChatGPT 的应用前景,挖掘它的潜力。其中,学术文本的理解与编辑是一种极具挑战性的应用场景,因为学术文本需要较高的专业性、严谨性等,有时还需要处理公式、代码、图谱等特殊的内容格式。现在,一个名为「ChatGPT 学术优化(chatgpt_academic)」的新项目在 GitHub 上爆火,上线几天就在 GitHub 上狂揽上万 Star。项目地址:https://github.com/

30行Python代码就可以调用ChatGPT API总结论文的主要内容30行Python代码就可以调用ChatGPT API总结论文的主要内容Apr 04, 2023 pm 12:05 PM

阅读论文可以说是我们的日常工作之一,论文的数量太多,我们如何快速阅读归纳呢?自从ChatGPT出现以后,有很多阅读论文的服务可以使用。其实使用ChatGPT API非常简单,我们只用30行python代码就可以在本地搭建一个自己的应用。 阅读论文可以说是我们的日常工作之一,论文的数量太多,我们如何快速阅读归纳呢?自从ChatGPT出现以后,有很多阅读论文的服务可以使用。其实使用ChatGPT API非常简单,我们只用30行python代码就可以在本地搭建一个自己的应用。使用 Python 和 C

vscode配置中文插件,带你无需注册体验ChatGPT!vscode配置中文插件,带你无需注册体验ChatGPT!Dec 16, 2022 pm 07:51 PM

​面对一夜爆火的 ChatGPT ,我最终也没抵得住诱惑,决定体验一下,不过这玩意要注册需要外国手机号以及科学上网,将许多人拦在门外,本篇博客将体验当下爆火的 ChatGPT 以及无需注册和科学上网,拿来即用的 ChatGPT 使用攻略,快来试试吧!

用ChatGPT秒建大模型!OpenAI全新插件杀疯了,接入代码解释器一键get用ChatGPT秒建大模型!OpenAI全新插件杀疯了,接入代码解释器一键getApr 04, 2023 am 11:30 AM

ChatGPT可以联网后,OpenAI还火速介绍了一款代码生成器,在这个插件的加持下,ChatGPT甚至可以自己生成机器学习模型了。 ​上周五,OpenAI刚刚宣布了惊爆的消息,ChatGPT可以联网,接入第三方插件了!而除了第三方插件,OpenAI也介绍了一款自家的插件「代码解释器」,并给出了几个特别的用例:解决定量和定性的数学问题;进行数据分析和可视化;快速转换文件格式。此外,Greg Brockman演示了ChatGPT还可以对上传视频文件进行处理。而一位叫Andrew Mayne的畅销作

ChatGPT教我学习PHP中AOP的实现(附代码)ChatGPT教我学习PHP中AOP的实现(附代码)Mar 30, 2023 am 10:45 AM

本篇文章给大家带来了关于php的相关知识,其中主要介绍了我是怎么用ChatGPT学习PHP中AOP的实现,感兴趣的朋友下面一起来看一下吧,希望对大家有帮助。

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
2 weeks agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

EditPlus Chinese cracked version

EditPlus Chinese cracked version

Small size, syntax highlighting, does not support code prompt function

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

DVWA

DVWA

Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software