Categories: NewsData

AlphaZero: The genesis of machine intuition

2 min read

Give it four days to practice and you would have a chess master ready! 

This line stands true for Deepmind’s latest AI program, AlphaZero.

AlphaZero is an advanced version of AlphaGo Zero–the AI that recently won all games of Go against its precursor AlphaGo–relies simply on self-play without any example games. AlphaZero is an improvement to it as it shows that the same program can master three different types of board games, Chess, Shogi and Go namely. It uses reinforcement learning algorithm to achieve state-of-the-art results.

AlphaZero mastered the game of chess, without having prior domain knowledge of the game, except the game rules. Additionally, it also mastered Shogi, a Japanese board game, as showcased in a recent DeepMind research paper.

Demis Hassabis, founder, and CEO, DeepMind introduced some additional details of AlphaZero at the Neural Information Processing Systems (NIPS) conference in Long Beach, California. “It doesn’t play like a human, and it doesn’t play like a program, it plays in a third, almost alien, way,” said Hassabis. It only took four hours to self-play and create chess knowledge beyond any human or computer program. Surprisingly, it defeated Stockfish 8 (A world champion chess engine) in four hours without any external help or any prior empirical data (a database of archived chess games, or well-known chess strategies and openings).

The hyper-parameter of AlphaGo Zero’s search was tuned by using Bayesian optimization algorithm. AlphaZero reuses the same hyper-parameter for playing all the board games without performing any game-specific tuning. Similar to AlphaGo Zero, AlphaZero’s board state is encoded by spatial planes based on specifically the basic rules for each game.

While training AlphaZero, the same algorithmic settings, network architecture, and hyper-parameters were used in all three games. A separate instance of AlphaZero was trained for each game. The training initiated for 700,000 steps (mini-batches of size 4,096) starting from randomly initialized parameters, with 5,000 first-generation TPUs to generate self-play games and 64 second-generation TPUs to train the neural networks.

After comprehensive analysis, it was found that AlphaZero outperformed

  • Stockfish in Chess in 4 hours
  • Elmo in Shogi in less than 2 hrs
  • AlphaGo Lee in Go in 8 hours

The achievements by AlphaZero are impressive, to say the least. Researchers at DeepMind say that it still needs to play many more practice games than a human chess champion. Human learning is based on watching other people play and also by learning in different ways, which a machine cannot achieve. But it can go beyond human thinking by expanding the capabilities of its program. To know more about how AlphaZero masters chess and Shogi using Reinforcement algorithm, you can have a look at the research paper here or tune into the game series on Youtube to watch the video.

Savia Lobo

A Data science fanatic. Loves to be updated with the tech happenings around the globe. Loves singing and composing songs. Believes in putting the art in smart.

Share
Published by
Savia Lobo

Recent Posts

Top life hacks for prepping for your IT certification exam

I remember deciding to pursue my first IT certification, the CompTIA A+. I had signed…

3 years ago

Learn Transformers for Natural Language Processing with Denis Rothman

Key takeaways The transformer architecture has proved to be revolutionary in outperforming the classical RNN…

3 years ago

Learning Essential Linux Commands for Navigating the Shell Effectively

Once we learn how to deploy an Ubuntu server, how to manage users, and how…

3 years ago

Clean Coding in Python with Mariano Anaya

Key-takeaways:   Clean code isn’t just a nice thing to have or a luxury in software projects; it's a necessity. If we…

3 years ago

Exploring Forms in Angular – types, benefits and differences   

While developing a web application, or setting dynamic pages and meta tags we need to deal with…

3 years ago

Gain Practical Expertise with the Latest Edition of Software Architecture with C# 9 and .NET 5

Software architecture is one of the most discussed topics in the software industry today, and…

3 years ago