Home Data News What if AIs could collaborate using human-like values? DeepMind researchers propose a...

What if AIs could collaborate using human-like values? DeepMind researchers propose a Hanabi platform.

February 6, 2019 - 6:53 am

2622

3 min read

A paper with inputs from 15 researchers talks about artificial intelligence systems playing a game called Hanabi in a paper titled The Hanabi Challenge: A New Frontier for AI Research. The researchers propose an experimental framework called the Hanabi Learning Environment for the AI research community to test out and advance algorithms. This can help in assessing the performance of the current state-of-the-art algorithms and techniques.

What’s special about Hanabi?

Hanabi is a two to five player game involving cards with numbers on them. In Hanabi, you are playing against other participants but you also need to trust the imperfect information they provide and make deductions to advance your cards. Hanabi has an imperfect nature as players cannot see their own cards. This game is a test of collaboratively sharing information with discretion. The rules are specified in detail in the research paper.

Games have always been used to showcase or test the ability of artificial intelligence and machine learning, be it Go, Chess, Dota 2, or other games. So why would Hanabi be ‘A New Frontier for AI Research’? The difference is that Hanabi needs a bit of human touch to play. Factors like trust, imperfect information, and co-operation come into the picture, with this game which is why it is a good testing ground for AI applications.

What’s the paper about?

The idea is to test the collaboration of AI agents where the information is limited and only implicit communication is allowed. The researchers say that Hanabi increases reasoning of beliefs and intentions of other AI agents and makes them prominent. They believe that, developing techniques that instill agents with such theory will, in addition to succeeding at Hanabi, unlock ways how agents can collaborate efforts with human partners.

The researchers have even introduced an open-source ‘Hanabi Learning Environment’, which is an experimental framework for other researchers to assess their techniques in the environment.

To play Hanabi, the theory of mind is necessary, which revolves around human-like traits such as beliefs, intents, desires, etc The human approach of the theory of mind reasoning is important not just in how humans approach this game. It is also about how humans handle communication and interactions when multiple parties are involved..

Results and further work

State-of-the-art reinforcement learning algorithms using deep learning are evaluated in the paper. In self-play, they fall short of the hand-coded Hanabi playing bots. In case of collaborative play, they do not collaborate at all. This shows that there is a lot of room for advances in this area related to theory of mind.

The code for the Hanabi Learning Environment is being written in Python and C++ and will be available on DeepMind’s GitHub. Its interface is similar to OpenAI Gym.

For more details about the game and how the theory will help in testing AI agent interactions, check out the research paper.

Top 6 Cybersecurity Books from Packt to Accelerate Your Career

Your Quick Introduction to Extended Events in Analysis Services from Blog…

Logging the history of my past SQL Saturday presentations from Blog…

Storage savings with Table Compression from Blog Posts – SQLServerCentral

Daily Coping 31 Dec 2020 from Blog Posts – SQLServerCentral

Learning Essential Linux Commands for Navigating the Shell Effectively

Exploring the Strategy Behavioral Design Pattern in Node.js

How to integrate a Medium editor in Angular 8

Implementing memory management with Golang’s garbage collector

How to create sales analysis app in Qlik Sense using DAR…

What if AIs could collaborate using human-like values? DeepMind researchers propose a Hanabi platform.

What’s special about Hanabi?

What’s the paper about?

Results and further work

Read next

MobilePro

datapro

Programming

Subscribe to our newsletter