DeepMind MuZero AI Learns The Rules As Its Plays To Master Atari Games, Chess, And More

erek · Dec 23, 2020

"With Agent57, however, data and rules needed to be fed into the AI so it could learn and then estimate the best course of action. Due to the estimation rather than modeling all outcomes, Agent57 was also a model-free algorithm. Calculating all game outcomes can be rather computationally intense, so it was not really ever implemented. Now, MuZero simply learns the rules of the game and models only the most important information, which makes it incredibly efficient over typical model-based systems. The DeepMind team writes on its blog about the three key elements for MuZero’s models:

The value: how good is the current position?
The policy: which action is the best to take?
The reward: how good was the last action?

They liken these model elements to the weather, wherein rather than modeling all the raindrops, it would be more useful to know an umbrella will keep you dry.

Overall, this AI seems to be an essential step toward AI that has better problem-solving skills. When problem-solving skills improve, AI can be applied to many more things than just games. It is a little bit like a child growing up, except it is an algorithm, learning and improving at an exceptionally fast rate. Perhaps we are on the way toward Skynet now."

https://hothardware.com/news/muzero-deepmind-ai

clockdogg · Dec 23, 2020

"Perhaps we are on the way toward Skynet now." - That's just great! Only have to wait for the children of men to grow up and we'll be fine...

Krenum · Dec 23, 2020

clockdogg said:
"Perhaps we are on the way toward Skynet now."

About time we have some excitement!

erek · Dec 23, 2020

Krenum said:
About time we have some excitement!

Hehe

GoldenTiger · Dec 23, 2020

Excited erek?

Just what 2020 needed to be worse... The beginnings of skynet!

chili dog · Dec 23, 2020

So did it learn to play E.T. on Atari?

Krenum · Dec 23, 2020

Pretty much my thoughts on the A.I

Terminator 2 wasn't a movie for entertainment purposes, it was a warning.

Lakados · Dec 23, 2020

Pretty sure Skynet looks like some Google/Facebook/Amazon hybrid, it uses deep fake and stock photo manipulations to create a 3D avatar for itself then hires an HR firm in a zoom meeting to hire people for the company and creates a physical entity to represent it. Then using its control over media and product purchasing placement it enslaves mankind to do its bidding.

Once large enough it can put itself up as numerous shareholders of its own company then hire a human CEO to run it while the AI directs them from the shadows using stock and market manipulations. Eventually, it gets powerful enough where it has indirect control over major governments by having control over so much of the material wealth of those countries, and in the end, we are all just SIM's for its amusement.

GoldenTiger · Dec 23, 2020

Krenum said:
Pretty much my thoughts on the A.I

Terminator 2 wasn't a movie for entertainment purposes, it was a warning.

So was 1984, and yet we've gone and done that! (shouts from rooftops) Stop the AI madness!

erek · Dec 24, 2020

Krenum said:
Pretty much my thoughts on the A.I

Terminator 2 wasn't a movie for entertainment purposes, it was a warning.

"Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge success in challenging domains, such as chess1 and Go2, where a perfect simulator is available. However, in real-world problems, the dynamics governing the environment are often complex and unknown. Here we present the MuZero algorithm, which, by combining a tree-based search with a learned model, achieves superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics. The MuZero algorithm learns an iterable model that produces predictions relevant to planning: the action-selection policy, the value function and the reward. When evaluated on 57 different Atari games3—the canonical video game environment for testing artificial intelligence techniques, in which model-based planning approaches have historically struggled4—the MuZero algorithm achieved state-of-the-art performance. When evaluated on Go, chess and shogi—canonical environments for high-performance planning—the MuZero algorithm matched, without any knowledge of the game dynamics, the superhuman performance of the AlphaZero algorithm5 that was supplied with the rules of the game."

Aurelius · Dec 24, 2020

Despite all the dystopian talk, AI really needs this. Robots struggle in part because they don't know how to cope with concepts and tasks they haven't been taught to perform. If a robot can understand not just the basics of a task but the systems that undergird that task, they won't need constant updates and babysitting.

erek · Dec 28, 2020

Aurelius said:
Despite all the dystopian talk, AI really needs this. Robots struggle in part because they don't know how to cope with concepts and tasks they haven't been taught to perform. If a robot can understand not just the basics of a task but the systems that undergird that task, they won't need constant updates and babysitting.

DeepMind’s big losses, and the questions around running an AI lab

https://venturebeat.com/2020/12/27/deepminds-big-losses-and-the-questions-around-running-an-ai-lab/

DeepMind MuZero AI Learns The Rules As Its Plays To Master Atari Games, Chess, And More

erek

[H]F Junkie

clockdogg

[H]ard|Gawd

Krenum

Fully [H]

erek

[H]F Junkie

GoldenTiger

Fully [H]

chili dog

Limp Gawd

Krenum

Fully [H]

Lakados

[H]F Junkie

GoldenTiger

Fully [H]

erek

[H]F Junkie

Aurelius

Supreme [H]ardness

erek

[H]F Junkie

DeepMind’s big losses, and the questions around running an AI lab

DeepMind MuZero AI Learns The Rules As Its Plays To Master Atari Games, Chess, And More

[H]F Junkie

[H]ard|Gawd

Fully [H]

[H]F Junkie

Fully [H]

Limp Gawd

Fully [H]

[H]F Junkie

Fully [H]

[H]F Junkie

Supreme [H]ardness

[H]F Junkie

DeepMind’s big losses, and the questions around running an AI lab​

DeepMind’s big losses, and the questions around running an AI lab