Skip to main content

Meet the humanoid robot that learns from natural language, mimics human emotions

The University of Tokyo created a humanoid robot, Alter3, who can mimic human actions, like taking selfies, throwing a ball or playing air guitar.

Imagine what it would be like to have a robot friend that can do things like take selfies, toss a ball, eat popcorn and play air guitar? 

Well, you might not have to wait too long.

Researchers at the University of Tokyo have created a robot that can do all that and more, thanks to the power of GPT-4, the latest and most advanced large language model (LLM) in the world.

CLICK TO GET KURT’S FREE CYBERGUY NEWSLETTER WITH SECURITY ALERTS, QUICK VIDEO TIPS, TECH REVIEWS, AND EASY HOW-TO’S TO MAKE YOU SMARTER

Alter3 is a humanoid robot that was first introduced in 2016 as a platform for exploring the concept of life in artificial systems. It has a realistic appearance and can move its upper body, head and facial muscles with 43 axes controlled by air actuators. It also has a camera in each eye that allows it to see and interact with humans and the environment.

WHAT IS ARTIFICIAL INTELLIGENCE (AI)?

But what makes Alter3 really special is that it can now use GPT-4, a deep learning model that can generate natural language texts from any given prompt, to control its movements and behaviors. This means that instead of having to program every single action for the robot, the researchers can simply give it verbal instructions and let GPT-4 generate the corresponding Python code that runs the Android engine.

CALIFORNIA LEGISLATIVE SESSION TO BE DOMINATED BY AI REGULATIONS AND STATE'S STRUGGLING BUDGET

For example, to make Alter3 take a selfie, the researchers can say something like:

"Create a big, joyful smile and widen your eyes to show excitement. Swiftly turn the upper body slightly to the left, adopting a dynamic posture. Raise the right hand high, simulating a phone. Flex the right elbow, bringing the phone closer to the face. Tilt the head slightly to the right, giving a playful vibe."

And GPT-4 will produce the code that makes Alter3 do exactly that.

MORE: HUMANOID ROBOTS ARE NOW DOING THE WORK OF HUMANS IN A SPANX WAREHOUSE 

The researchers have tested Alter3 with GPT-4 in various scenarios, such as tossing a ball, eating popcorn, and playing air guitar. They have also experimented with different types of feedback, such as linguistic, visual, and emotional, to improve the robot’s performance and adaptability.

One of the most interesting aspects of Alter3’s behavior is that it can learn from its own memory and from human responses. For instance, if the robot does something that makes a human laugh or smile, it will remember that and try to repeat it in the future. This is similar to how newborn babies imitate their parents’ expressions and gestures.

MORE: THE NEXT GENERATION OF TESLA'S HUMANOID ROBOT MAKES ITS DEBUT

The researchers have also added some humor and personality to Alter3’s actions. In one case, the robot pretends to eat a bag of popcorn, only to realize that it belongs to the person sitting next to it. It then shows a surprised and embarrassed expression and apologizes with its arms.

The research team behind Alter3 believes that this is a breakthrough in the field of robotics and artificial intelligence, as it shows how large language models can be used to bridge the gap between natural language and robot control. This opens up new possibilities for human-robot collaboration and communication, as well as for creating more intelligent, adaptable, and personable robotic entities.

MORE: HOW THIS ROBOT HELPS YOU PROTECT AND CONNECT YOUR HOME

The paper, titled "From Text to Motion: Grounding GPT-4 in a Humanoid Robot ‘Alter3,’" was written by Takahide Yoshida, Atsushi Masumori and Takashi Ikegami and is available on the preprint server arXiv. The authors hope that their work will inspire more research and development in this direction and that one day we might be able to have robot friends that can understand us and share our interests and emotions.

Alter3 is an example of how natural language processing and robotics can work together to create pretty incredible interactions. By using GPT-4, the robot can perform a variety of tasks and behaviors based on verbal commands, without requiring extensive programming or manual control. This also allows the robot to learn from its own experience and from human feedback and to express some humor and personality. Alter3 demonstrates the potential of large language models to improve the field of robotics and artificial intelligence as well as bring us closer to having robot friends that can relate to us and entertain us.

What do you think of Alter3 and its abilities? Would you like to have a robot like that in your life? Let us know by writing us at Cyberguy.com/Contact.

For more of my tech tips & security alerts, subscribe to my free CyberGuy Report Newsletter by heading to Cyberguy.com/Newsletter.

Ask Kurt a question or let us know what stories you'd like us to cover.

Answers to the most asked CyberGuy questions:

Ideas for using those Holiday Gift cards:

Copyright 2024 CyberGuy.com. All rights reserved.

Data & News supplied by www.cloudquote.io
Stock quotes supplied by Barchart
Quotes delayed at least 20 minutes.
By accessing this page, you agree to the following
Privacy Policy and Terms and Conditions.