Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Mastering Atari, Go, chess and shogi by planning with a learned model.

Literature DB >> 33361790

Mastering Atari, Go, chess and shogi by planning with a learned model.

Julian Schrittwieser¹, Ioannis Antonoglou^1,2, Thomas Hubert¹, David Silver^3,4, Karen Simonyan¹, Laurent Sifre¹, Simon Schmitt¹, Arthur Guez¹, Edward Lockhart¹, Demis Hassabis¹, Thore Graepel^1,2, Timothy Lillicrap¹.

Abstract

Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge success in challenging domains, such as chess1 and Go2, where a perfect simulator is available. However, in real-world problems, the dynamics governing the environment are often complex and unknown. Here we present the MuZero algorithm, which, by combining a tree-based search with a learned model, achieves superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics. The MuZero algorithm learns an iterable model that produces predictions relevant to planning: the action-selection policy, the value function and the reward. When evaluated on 57 different Atari games3-the canonical video game environment for testing artificial intelligence techniques, in which model-based planning approaches have historically struggled4-the MuZero algorithm achieved state-of-the-art performance. When evaluated on Go, chess and shogi-canonical environments for high-performance planning-the MuZero algorithm matched, without any knowledge of the game dynamics, the superhuman performance of the AlphaZero algorithm5 that was supplied with the rules of the game.

Entities: Species

Year: 2020 PMID： 33361790 DOI： 10.1038/s41586-020-03051-4

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

Keyword Cloud
Cited

31 in total

Review 1. Artificial intelligence unifies knowledge and actions in drug repositioning.

Authors: Zheng Yin; Stephen T C Wong
Journal: Emerg Top Life Sci Date: 2021-12-21

Review 2. Structure-based protein design with deep learning.

Authors: Sergey Ovchinnikov; Po-Ssu Huang
Journal: Curr Opin Chem Biol Date: 2021-09-20 Impact factor: 8.822

Review 3. How learning unfolds in the brain: toward an optimization view.

Authors: Jay A Hennig; Emily R Oby; Darby M Losey; Aaron P Batista; Byron M Yu; Steven M Chase
Journal: Neuron Date: 2021-10-13 Impact factor: 17.173

4. Learning cortical representations through perturbed and adversarial dreaming.

Authors: Walter Senn; Jakob Jordan; Nicolas Deperrois; Mihai A Petrovici
Journal: Elife Date: 2022-04-06 Impact factor: 8.713

5. Towards intellectual freedom in an AI Ethics Global Community.

Authors: Christoph Ebell; Ricardo Baeza-Yates; Richard Benjamins; Hengjin Cai; Mark Coeckelbergh; Tania Duarte; Merve Hickok; Aurelie Jacquet; Angela Kim; Joris Krijger; John MacIntyre; Piyush Madhamshettiwar; Lauren Maffeo; Jeanna Matthews; Larry Medsker; Peter Smith; Savannah Thais
Journal: AI Ethics Date: 2021-04-13

Mastering Atari, Go, chess and shogi by planning with a learned model.

Review 1. Artificial intelligence unifies knowledge and actions in drug repositioning.

Review 2. Structure-based protein design with deep learning.

Review 3. How learning unfolds in the brain: toward an optimization view.

4. Learning cortical representations through perturbed and adversarial dreaming.

5. Towards intellectual freedom in an AI Ethics Global Community.

6. The neuroecology of the water-to-land transition and the evolution of the vertebrate brain.

Review 7. Formalizing planning and information search in naturalistic decision-making.

Review 8. Promises and challenges of human computational ethology.

9. Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents.

Review 10. Nobel Turing Challenge: creating the engine for scientific discovery.