erek
[H]F Junkie
- Joined
- Dec 19, 2005
- Messages
- 10,894
"With Agent57, however, data and rules needed to be fed into the AI so it could learn and then estimate the best course of action. Due to the estimation rather than modeling all outcomes, Agent57 was also a model-free algorithm. Calculating all game outcomes can be rather computationally intense, so it was not really ever implemented. Now, MuZero simply learns the rules of the game and models only the most important information, which makes it incredibly efficient over typical model-based systems. The DeepMind team writes on its blog about the three key elements for MuZero’s models:
Overall, this AI seems to be an essential step toward AI that has better problem-solving skills. When problem-solving skills improve, AI can be applied to many more things than just games. It is a little bit like a child growing up, except it is an algorithm, learning and improving at an exceptionally fast rate. Perhaps we are on the way toward Skynet now."
https://hothardware.com/news/muzero-deepmind-ai
They liken these model elements to the weather, wherein rather than modeling all the raindrops, it would be more useful to know an umbrella will keep you dry.The value: how good is the current position?
The policy: which action is the best to take?
The reward: how good was the last action?
Overall, this AI seems to be an essential step toward AI that has better problem-solving skills. When problem-solving skills improve, AI can be applied to many more things than just games. It is a little bit like a child growing up, except it is an algorithm, learning and improving at an exceptionally fast rate. Perhaps we are on the way toward Skynet now."
https://hothardware.com/news/muzero-deepmind-ai