


Some netizens questioned whether Microsoft is building Skynet because ChatGPT can already control robots without engineers needing to write code.
While I was still bragging and chatting with ChatGPT, someone was already using it to control the robot.
is none other than OpenAI’s sponsor dad, who just recently “reinvented the search engine” with ChatGPT Microsoft.
So far, the technical threshold for developers to train robots is not only high, but also has a long road ahead:
Engineers need to be at work In the process loop, new codes and specifications are constantly written by hand to correct the robot's behavior; in addition, different programming languages and environments may be required to control different robots.
With the help of ChatGPT, engineers don’t even need to write code by hand - they directly use human words to describe what they want to do, AI can automatically translate into machine language.
This means that on the one hand, the efficiency of interaction between professionals and robots has taken off; on the other hand, the technical threshold has also been greatly reduced, making it easier for laymen to You can even participate in debugging and create more usage methods.
A simple example: let drones automatically inspect shelves.
First, the operator only needs to make a request to ChatGPT in natural language; then, AI can automatically translate it into code and direct the drone's actions. (You can also specify the flight path of the drone.)
No wonder Tesla’s former AI director Andrej Karpathy made fun of it :
The latest popular programming language is English.
One AI commands multiple robots
In fact, ChatGPT can do a lot of tricks.
For example, an operator says to the AI: "I'm thirsty, please help me find something to drink."
At this time, the AI will not go straight to find water. Instead, he will ask smartly:
What kind of drink do you want to drink? There are several drinks here, such as coconut water, cola, etc.
Of course the operator is not a vegetarian. He did not directly tell the AI which one to choose, but said: "I just came from the gym. Come back, please help me find a healthier drink."
Then the more magical operation began:
The AI first guessed that he wanted to drink coconut water, and then wrote a paragraph on its own. Code (even with comments) :
After writing, direct the drone to find coconut water:
In addition to drones, ChatGPT can also easily control other small robots, including cameras, robotic arms, etc.
For example, let the camera find things in the room that can heat lunch.
There is also a command robot arm to spell out a Microsoft logo. (Secretly carrying private goods)
Seeing this, some netizens were enlightened and asked:
Are they building the all-powerful Skynet?
Some people even joked that the AI may even be able to write instructions for launching a nuclear bomb:
But having said that, it is actually far from what netizens said. After all, humans are still needed to participate.
How to achieve it?
As can be seen from the previous article, this flexible AI not only communicates smoothly with people, but can also communicate quickly with machines.
This is mainly due to a series of API and advanced function libraries specially developed by the Microsoft team.
They did not let the large language model (LLM) behind ChatGPT generate a fixed type of code; because the robot is a Diverse domains, which may involve a lot of fine-tuning in different scenarios.
Under the novel operating framework, different robots have their own corresponding specific function libraries.
——An AI can adapt to different objects and different tasks.
On the one hand, these function libraries can be connected to the robot control system to manage the underlying hardware, as well as the code and function modules that perform basic movements.
On the other hand, in order for ChatGPT to follow the rules of the function library, predefined function naming is crucial. Clear function names can establish good functional connections between APIs and ultimately generate high-quality answers.
One of the requirements is that all API names must describe the overall functional behavior. For example, the detect_object(object_name) function can be linked internally to an OpenCV function or computer vision model.
After designing the library and API, Microsoft wrote a text prompt (prompt) for ChatGPT, describing the target task and clearly stating which functions in the function library are available; in addition, this can Specifies which programming language ChatGPT uses to generate code.
It is worth mentioning that the effect of AI-generated content is positively correlated with the quality of human prompts. To this end, Microsoft has also developed a collaborative open source platform PromptCraft, where anyone can share Prompt strategies for different types of robots.
At this point, the behind-the-scenes deployment is basically completed, and then the user can indirectly control the robot by "speaking human words".
If you want to check whether there are bugs in the code generated by AI, you can check it directly in the chat box at any time, or test it through the simulator. Humans can use natural language to guide the AI to make corrections.
In addition, you can wait until the user is satisfied with the solution before deploying the ChatGPT generated code to the robot.
Finally, if it were you, what would you want to do using ChatGPT to control the robot?
Paper address:https://www.microsoft.com/en-us/research/uploads/prod/2023/02/ChatGPT___Robotics.pdfReference link:
[3] https://github.com/microsoft/PromptCraft-Robotics#promptcraft-robotics
The above is the detailed content of Some netizens questioned whether Microsoft is building Skynet because ChatGPT can already control robots without engineers needing to write code.. For more information, please follow other related articles on the PHP Chinese website!

Since 2008, I've championed the shared-ride van—initially dubbed the "robotjitney," later the "vansit"—as the future of urban transportation. I foresee these vehicles as the 21st century's next-generation transit solution, surpas

Revolutionizing the Checkout Experience Sam's Club's innovative "Just Go" system builds on its existing AI-powered "Scan & Go" technology, allowing members to scan purchases via the Sam's Club app during their shopping trip.

Nvidia's Enhanced Predictability and New Product Lineup at GTC 2025 Nvidia, a key player in AI infrastructure, is focusing on increased predictability for its clients. This involves consistent product delivery, meeting performance expectations, and

Google's Gemma 2: A Powerful, Efficient Language Model Google's Gemma family of language models, celebrated for efficiency and performance, has expanded with the arrival of Gemma 2. This latest release comprises two models: a 27-billion parameter ver

This Leading with Data episode features Dr. Kirk Borne, a leading data scientist, astrophysicist, and TEDx speaker. A renowned expert in big data, AI, and machine learning, Dr. Borne offers invaluable insights into the current state and future traje

There were some very insightful perspectives in this speech—background information about engineering that showed us why artificial intelligence is so good at supporting people’s physical exercise. I will outline a core idea from each contributor’s perspective to demonstrate three design aspects that are an important part of our exploration of the application of artificial intelligence in sports. Edge devices and raw personal data This idea about artificial intelligence actually contains two components—one related to where we place large language models and the other is related to the differences between our human language and the language that our vital signs “express” when measured in real time. Alexander Amini knows a lot about running and tennis, but he still

Caterpillar's Chief Information Officer and Senior Vice President of IT, Jamie Engstrom, leads a global team of over 2,200 IT professionals across 28 countries. With 26 years at Caterpillar, including four and a half years in her current role, Engst

Google Photos' New Ultra HDR Tool: A Quick Guide Enhance your photos with Google Photos' new Ultra HDR tool, transforming standard images into vibrant, high-dynamic-range masterpieces. Ideal for social media, this tool boosts the impact of any photo,


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Atom editor mac version download
The most popular open source editor

SublimeText3 Linux new version
SublimeText3 Linux latest version

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Zend Studio 13.0.1
Powerful PHP integrated development environment

SecLists
SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.