Categories: Technology Facts

These virtual robot arms get smarter by training each otherWill Heavenon January 22, 2021 at 12:50 pm

A virtual robot arm has learned to solve a wide range of different puzzles—stacking blocks, setting the table, arranging chess pieces—without having to be retrained for each task. It did this by playing against a second robot arm that was trained to give it harder and harder challenges.

Self play: Developed by researchers at OpenAI, the identical robot arms—Alice and Bob—learn by playing a game against each other in a simulation, without human input. The robots use reinforcement learning, a technique in which AIs are trained by trial and error what actions to take in different situations to achieve certain goals. The game involves moving objects around on a virtual tabletop. By arranging objects in specific ways, Alice tries to set puzzles that are hard for Bob to solve. Bob tries to solve Alice’s puzzles. As they learn, Alice sets more complex puzzles and Bob gets better at solving them.

After training on block puzzles set by Alice, Bob can generalize to a range of tasks, including setting a table and arranging chess pieces.

Multitasking: Deep-learning models typically have to be retrained between tasks. For example, AlphaZero (which also learns by playing games against itself) uses a single algorithm to teach itself to play chess, shogi and Go—but only one game at a time. The chess-playing AlphaZero cannot play Go and the Go-playing one cannot play shogi. Building machines that really can multitask is a big unsolved problem on the road to more general AI.

AI dojo: One issue is that training an AI to multitask requires a vast number of examples. OpenAI avoids this by training Alice to generate the examples for Bob, using one AI to train another. Alice learned to set goals such as building a tower of blocks, then picking up it up and balancing it. Bob learned to use properties of the (virtual) environment, such as friction, to grasp and rotate objects.

Virtual reality: So far the approach has only been tested in a simulation but researchers at OpenAI and elsewhere are getting better at transferring models trained in virtual environments to physical ones. A simulation lets AIs churn through large datasets in a short amount of time, before being fine-tuned for real-world settings.

Overall ambition: The researchers say that their ultimate aim is to train a robot to solve any task that a person might ask it to. Like GPT-3, a language model that can use language in a wide variety of different ways, these robot arms are part of OpenAI’s overall ambition to build a multitasking AI. Using one AI to train another could be a key part of that.

Next The Interesting History of American Football »

Previous « Forget Dieting. Here’s What Really Works to Lose Weighton January 22, 2021 at 10:00 am

Building Products That Rely on Engineered Resins

Engineered resins power durable building products, from laminates to advanced composites. See how material choices…

5 days ago

Uncategorized

Understanding the Key Causes of Compressed Air Leaks

Understand the common causes of compressed air leaks, how they impact system performance, and ways…

6 days ago

Uncategorized

Top 5 Ways To Improve the Patient Experience in 2026

Explore five practical ways to elevate the patient experience in 2026. Boost comfort, communication, and…

6 days ago

Uncategorized

Solvent Waste Liability: Save Money by Reducing Risk

Cut solvent waste liability and disposal costs with practical steps that lower volume, improve handling,…

1 week ago

Uncategorized

Renovations to Increase Your Home’s Value

Smart renovations can boost your home’s value and appeal. From kitchen updates to basement finishing,…

1 week ago

Uncategorized

Who is Tahir Garaev: Professional Profile and Academic Contributions

Tahir Garaev is a Georgian historian and researcher whose professional standing derives from specialized expertise…

1 week ago

These virtual robot arms get smarter by training each otherWill Heavenon January 22, 2021 at 12:50 pm

Related Post

Recent Posts

Building Products That Rely on Engineered Resins

Understanding the Key Causes of Compressed Air Leaks

Top 5 Ways To Improve the Patient Experience in 2026

Solvent Waste Liability: Save Money by Reducing Risk

Renovations to Increase Your Home’s Value

Who is Tahir Garaev: Professional Profile and Academic Contributions