Literature DB >> 33361790

Mastering Atari, Go, chess and shogi by planning with a learned model.

Julian Schrittwieser1, Ioannis Antonoglou1,2, Thomas Hubert1, David Silver3,4, Karen Simonyan1, Laurent Sifre1, Simon Schmitt1, Arthur Guez1, Edward Lockhart1, Demis Hassabis1, Thore Graepel1,2, Timothy Lillicrap1.   

Abstract

Constructing agents with planning capabilities has long been one of the main challenges in the pursuit of artificial intelligence. Tree-based planning methods have enjoyed huge success in challenging domains, such as chess1 and Go2, where a perfect simulator is available. However, in real-world problems, the dynamics governing the environment are often complex and unknown. Here we present the MuZero algorithm, which, by combining a tree-based search with a learned model, achieves superhuman performance in a range of challenging and visually complex domains, without any knowledge of their underlying dynamics. The MuZero algorithm learns an iterable model that produces predictions relevant to planning: the action-selection policy, the value function and the reward. When evaluated on 57 different Atari games3-the canonical video game environment for testing artificial intelligence techniques, in which model-based planning approaches have historically struggled4-the MuZero algorithm achieved state-of-the-art performance. When evaluated on Go, chess and shogi-canonical environments for high-performance planning-the MuZero algorithm matched, without any knowledge of the game dynamics, the superhuman performance of the AlphaZero algorithm5 that was supplied with the rules of the game.

Entities:  

Year:  2020        PMID: 33361790     DOI: 10.1038/s41586-020-03051-4

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  31 in total

Review 1.  Artificial intelligence unifies knowledge and actions in drug repositioning.

Authors:  Zheng Yin; Stephen T C Wong
Journal:  Emerg Top Life Sci       Date:  2021-12-21

Review 2.  Structure-based protein design with deep learning.

Authors:  Sergey Ovchinnikov; Po-Ssu Huang
Journal:  Curr Opin Chem Biol       Date:  2021-09-20       Impact factor: 8.822

Review 3.  How learning unfolds in the brain: toward an optimization view.

Authors:  Jay A Hennig; Emily R Oby; Darby M Losey; Aaron P Batista; Byron M Yu; Steven M Chase
Journal:  Neuron       Date:  2021-10-13       Impact factor: 17.173

4.  Learning cortical representations through perturbed and adversarial dreaming.

Authors:  Walter Senn; Jakob Jordan; Nicolas Deperrois; Mihai A Petrovici
Journal:  Elife       Date:  2022-04-06       Impact factor: 8.713

5.  Towards intellectual freedom in an AI Ethics Global Community.

Authors:  Christoph Ebell; Ricardo Baeza-Yates; Richard Benjamins; Hengjin Cai; Mark Coeckelbergh; Tania Duarte; Merve Hickok; Aurelie Jacquet; Angela Kim; Joris Krijger; John MacIntyre; Piyush Madhamshettiwar; Lauren Maffeo; Jeanna Matthews; Larry Medsker; Peter Smith; Savannah Thais
Journal:  AI Ethics       Date:  2021-04-13

6.  The neuroecology of the water-to-land transition and the evolution of the vertebrate brain.

Authors:  Malcolm A MacIver; Barbara L Finlay
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-12-27       Impact factor: 6.237

Review 7.  Formalizing planning and information search in naturalistic decision-making.

Authors:  L T Hunt; N D Daw; P Kaanders; M A MacIver; U Mugan; E Procyk; A D Redish; E Russo; J Scholl; K Stachenfeld; C R E Wilson; N Kolling
Journal:  Nat Neurosci       Date:  2021-06-21       Impact factor: 28.771

Review 8.  Promises and challenges of human computational ethology.

Authors:  Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal:  Neuron       Date:  2021-06-17       Impact factor: 18.688

9.  Co-Evolution of Predator-Prey Ecosystems by Reinforcement Learning Agents.

Authors:  Jeongho Park; Juwon Lee; Taehwan Kim; Inkyung Ahn; Jooyoung Park
Journal:  Entropy (Basel)       Date:  2021-04-13       Impact factor: 2.524

Review 10.  Nobel Turing Challenge: creating the engine for scientific discovery.

Authors:  Hiroaki Kitano
Journal:  NPJ Syst Biol Appl       Date:  2021-06-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.