Google AI researchers introduce PlaNet, an AI agent that can learn about the world using only images

2 min read

The Google AI team in collaboration with DeepMind announced a new and open source “Deep Planning” Network, called PlaNet, last week. PlaNet is an AI agent that learns a world model using only image inputs and further plans with these models to gain experiences.

PlaNet can easily solve a variety of image-based control tasks as well as compete with the advanced model-free agents. The Google AI team is also releasing the source code for the research community to further explore and build upon PlaNet.

How does PlaNet work?

PlaNet depends on a compact sequence of hidden or latent states. This is known called a latent dynamics model where instead of predicting directly from one image to the next image, the latent state forward is first predicted. “By compressing the images in this way, the agent can automatically learn more abstract representations, such as positions and velocities of objects, making it easier to predict forward without having to generate images along the way”, states the Google AI team.

In a latent dynamics model, the information of the input images gets integrated into the hidden states with the help of an encoder network. The hidden state then gets further projected forward to predict future images and rewards. For planning, past images are encoded into the current hidden state, and then the future rewards for multiple action sequences are predicted.

PlaNet agents trained on different image-based control tasks

PlaNet agents are trained across a variety of image-based control tasks. These tasks pose different challenges such as partial observability, sparse rewards for catching a ball, etc. Moreover, a single PlaNet agent is trained to solve all six tasks. Without any changes to the hyperparameters, this multi-task agent is able to achieve the same mean performance as individual agents.

“We advocate for further research that focuses on learning accurate dynamics models on tasks of even higher difficulty, such as 3D environments and real-world robotics tasks. We are excited about the possibilities that model-based reinforcement learning opens up”, states the Google AI team.

For more information, check out the official Google AI PlaNet announcement.

Top life hacks for prepping for your IT certification exam

I remember deciding to pursue my first IT certification, the CompTIA A+. I had signed…

3 years ago

Artificial Intelligence

Learn Transformers for Natural Language Processing with Denis Rothman

Key takeaways The transformer architecture has proved to be revolutionary in outperforming the classical RNN…

3 years ago

Servers

Learning Essential Linux Commands for Navigating the Shell Effectively

Once we learn how to deploy an Ubuntu server, how to manage users, and how…

3 years ago

Interviews

Clean Coding in Python with Mariano Anaya

Key-takeaways: Clean code isn’t just a nice thing to have or a luxury in software projects; it's a necessity. If we…

3 years ago

Front-End Web Development

Exploring Forms in Angular – types, benefits and differences   

While developing a web application, or setting dynamic pages and meta tags we need to deal with…

3 years ago

Featured

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Software architecture is one of the most discussed topics in the software industry today, and…

3 years ago

Google AI researchers introduce PlaNet, an AI agent that can learn about the world using only images

How does PlaNet work?

Read Next

Recent Posts

Top life hacks for prepping for your IT certification exam

Learn Transformers for Natural Language Processing with Denis Rothman

Learning Essential Linux Commands for Navigating the Shell Effectively

Clean Coding in Python with Mariano Anaya

Exploring Forms in Angular – types, benefits and differences

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Google AI researchers introduce PlaNet, an AI agent that can learn about the world using only images

How does PlaNet work?

Read Next

Related Post

Recent Posts

Top life hacks for prepping for your IT certification exam

Learn Transformers for Natural Language Processing with Denis Rothman

Learning Essential Linux Commands for Navigating the Shell Effectively

Clean Coding in Python with Mariano Anaya

Exploring Forms in Angular – types, benefits and differences

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Exploring Forms in Angular – types, benefits and differences