search
HomeTechnology peripheralsAIResearchers develop robot that can understand English commands and perform household chores

A team of researchers from Princeton University, Stanford University, and Google used OpenAI’s GPT-3 Davinci model to develop a robot named TidyBot that can understand English instructions and perform household chores. This robot can automatically complete tasks such as sorting laundry, picking up garbage on the floor, and picking up toys according to the user's preferences.

Researchers develop robot that can understand English commands and perform household chores

The GPT-3 Davinci model is a deep learning model, part of the GPT model family, that can understand and generate natural language. The model has powerful summarization capabilities and can learn complex object attributes and relationships from large amounts of text data. The researchers used this ability to have the robot place objects based on several example objects provided by the user, such as "yellow shirt in the drawer, dark purple shirt in the closet, white socks in the drawer" and then let the model conclude The user's general preference rules and apply them to future interactions.

The researchers wrote in the paper: "Our basic insight is that the summarization capabilities of LLM (Large Language Model) are a good match for the generalization needs of personalized robots. LLM demonstrates the ability to achieve generalization through summarization Amazing ability to exploit complex object properties and relationships learned from massive text datasets."

They also write: "Unlike traditional methods that require expensive data collection and model training, we show that LLM can Achieve generalization in the field of robotics directly out of the box, leveraging the powerful summarization capabilities they learn from massive amounts of text data."

The researchers demonstrated a robot on the paper's website that can do laundry. Separate into light and dark colors, recycle drink cans, throw away trash, pack bags and cutlery, put scattered items back in their place, and put toys in drawers.

The researchers first tested a text-based benchmark dataset in which user preferences were entered and the model was asked to create personalization rules to determine item attribution. The model summarizes the examples into general rules and uses the summary to determine where to place new items. Baseline scenes are defined in four rooms, each with 24 scenes. Each scene contains between two and five places to place items, and there are an equal number of seen and unseen items for the model to classify. The test achieved 91.2 percent accuracy on unseen items, they wrote.

When they applied this method to a real-world robot, TidyBot, they found that it was able to successfully pick up 85 percent of the objects. TidyBot was tested in eight real-life scenarios, each with a set of ten objects, and the robot was run three times in each scenario. According to IT House, in addition to LLM, TidyBot also uses an image classifier called CLIP and an object detector called OWL-ViT.

Danfei Xu, an assistant professor at Georgia Institute of Technology’s School of Interactive Computing, said when talking about Google’s PaLM-E model that LLM gives robots more problem-solving capabilities. "Most previous mission planning systems relied on some form of search or optimization algorithms, which were less flexible and difficult to build. LLM and multimodal LLM enable these systems to benefit from Internet-scale data and easily use to solve new problems," he said.

The above is the detailed content of Researchers develop robot that can understand English commands and perform household chores. For more information, please follow other related articles on the PHP Chinese website!

Statement
This article is reproduced at:51CTO.COM. If there is any infringement, please contact admin@php.cn delete
Tool Calling in LLMsTool Calling in LLMsApr 14, 2025 am 11:28 AM

Large language models (LLMs) have surged in popularity, with the tool-calling feature dramatically expanding their capabilities beyond simple text generation. Now, LLMs can handle complex automation tasks such as dynamic UI creation and autonomous a

How ADHD Games, Health Tools & AI Chatbots Are Transforming Global HealthHow ADHD Games, Health Tools & AI Chatbots Are Transforming Global HealthApr 14, 2025 am 11:27 AM

Can a video game ease anxiety, build focus, or support a child with ADHD? As healthcare challenges surge globally — especially among youth — innovators are turning to an unlikely tool: video games. Now one of the world’s largest entertainment indus

UN Input On AI: Winners, Losers, And OpportunitiesUN Input On AI: Winners, Losers, And OpportunitiesApr 14, 2025 am 11:25 AM

“History has shown that while technological progress drives economic growth, it does not on its own ensure equitable income distribution or promote inclusive human development,” writes Rebeca Grynspan, Secretary-General of UNCTAD, in the preamble.

Learning Negotiation Skills Via Generative AILearning Negotiation Skills Via Generative AIApr 14, 2025 am 11:23 AM

Easy-peasy, use generative AI as your negotiation tutor and sparring partner. Let’s talk about it. This analysis of an innovative AI breakthrough is part of my ongoing Forbes column coverage on the latest in AI, including identifying and explaining

TED Reveals From OpenAI, Google, Meta Heads To Court, Selfie With MyselfTED Reveals From OpenAI, Google, Meta Heads To Court, Selfie With MyselfApr 14, 2025 am 11:22 AM

The ​TED2025 Conference, held in Vancouver, wrapped its 36th edition yesterday, April 11. It featured 80 speakers from more than 60 countries, including Sam Altman, Eric Schmidt, and Palmer Luckey. TED’s theme, “humanity reimagined,” was tailor made

Joseph Stiglitz Warns Of The Looming Inequality Amid AI Monopoly PowerJoseph Stiglitz Warns Of The Looming Inequality Amid AI Monopoly PowerApr 14, 2025 am 11:21 AM

Joseph Stiglitz is renowned economist and recipient of the Nobel Prize in Economics in 2001. Stiglitz posits that AI can worsen existing inequalities and consolidated power in the hands of a few dominant corporations, ultimately undermining economic

What is Graph Database?What is Graph Database?Apr 14, 2025 am 11:19 AM

Graph Databases: Revolutionizing Data Management Through Relationships As data expands and its characteristics evolve across various fields, graph databases are emerging as transformative solutions for managing interconnected data. Unlike traditional

LLM Routing: Strategies, Techniques, and Python ImplementationLLM Routing: Strategies, Techniques, and Python ImplementationApr 14, 2025 am 11:14 AM

Large Language Model (LLM) Routing: Optimizing Performance Through Intelligent Task Distribution The rapidly evolving landscape of LLMs presents a diverse range of models, each with unique strengths and weaknesses. Some excel at creative content gen

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. Best Graphic Settings
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
R.E.P.O. How to Fix Audio if You Can't Hear Anyone
4 weeks agoBy尊渡假赌尊渡假赌尊渡假赌
WWE 2K25: How To Unlock Everything In MyRise
1 months agoBy尊渡假赌尊渡假赌尊渡假赌

Hot Tools

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

SecLists

SecLists

SecLists is the ultimate security tester's companion. It is a collection of various types of lists that are frequently used during security assessments, all in one place. SecLists helps make security testing more efficient and productive by conveniently providing all the lists a security tester might need. List types include usernames, passwords, URLs, fuzzing payloads, sensitive data patterns, web shells, and more. The tester can simply pull this repository onto a new test machine and he will have access to every type of list he needs.

Dreamweaver Mac version

Dreamweaver Mac version

Visual web development tools

PhpStorm Mac version

PhpStorm Mac version

The latest (2018.2.1) professional PHP integrated development tool