Literature DB >> 29052630

Mastering the game of Go without human knowledge.

David Silver1, Julian Schrittwieser1, Karen Simonyan1, Ioannis Antonoglou1, Aja Huang1, Arthur Guez1, Thomas Hubert1, Lucas Baker1, Matthew Lai1, Adrian Bolton1, Yutian Chen1, Timothy Lillicrap1, Fan Hui1, Laurent Sifre1, George van den Driessche1, Thore Graepel1, Demis Hassabis1.   

Abstract

A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo's own move selections and also the winner of AlphaGo's games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100-0 against the previously published, champion-defeating AlphaGo.

Entities:  

Mesh:

Year:  2017        PMID: 29052630     DOI: 10.1038/nature24270

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  6 in total

1.  Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit.

Authors:  R H Hahnloser; R Sarpeshkar; M A Mahowald; R J Douglas; H S Seung
Journal:  Nature       Date:  2000-06-22       Impact factor: 49.962

2.  Mastering the game of Go with deep neural networks and tree search.

Authors:  David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.

Authors:  Matej Moravčík; Martin Schmid; Neil Burch; Viliam Lisý; Dustin Morrill; Nolan Bard; Trevor Davis; Kevin Waugh; Michael Johanson; Michael Bowling
Journal:  Science       Date:  2017-03-02       Impact factor: 47.728

5.  Neocognitron: a self organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.

Authors:  K Fukushima
Journal:  Biol Cybern       Date:  1980       Impact factor: 2.086

6.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

  6 in total
  253 in total

1.  Gamorithm.

Authors:  Moshe Sipper; Jason H Moore
Journal:  IEEE Trans Games       Date:  2018-08-29

Review 2.  Learning task-state representations.

Authors:  Yael Niv
Journal:  Nat Neurosci       Date:  2019-09-24       Impact factor: 24.884

3.  Clinical Personal Connectomics Using Hybrid PET/MRI.

Authors:  Dong Soo Lee
Journal:  Nucl Med Mol Imaging       Date:  2019-01-15

4.  Nuclear safety in the unexpected second nuclear era.

Authors:  Yican Wu; Zhibin Chen; Zhen Wang; Shanqi Chen; Daochuan Ge; Chao Chen; Jiangtao Jia; Yazhou Li; Ming Jin; Tao Zhou; Fang Wang; Liqin Hu
Journal:  Proc Natl Acad Sci U S A       Date:  2019-08-19       Impact factor: 11.205

5.  Artificial Neural Network with Composite Architectures for Prediction of Local Control in Radiotherapy.

Authors:  Sunan Cui; Yi Luo; Huan Hsin Tseng; Randall K Ten Haken; Issam El Naqa
Journal:  IEEE Trans Radiat Plasma Med Sci       Date:  2018-11-29

6.  Opinion: What does AI's success playing complex board games tell brain scientists?

Authors:  Dale Purves
Journal:  Proc Natl Acad Sci U S A       Date:  2019-07-23       Impact factor: 11.205

Review 7.  Machine behaviour.

Authors:  Iyad Rahwan; Manuel Cebrian; Nick Obradovich; Josh Bongard; Jean-François Bonnefon; Cynthia Breazeal; Jacob W Crandall; Nicholas A Christakis; Iain D Couzin; Matthew O Jackson; Nicholas R Jennings; Ece Kamar; Isabel M Kloumann; Hugo Larochelle; David Lazer; Richard McElreath; Alan Mislove; David C Parkes; Alex 'Sandy' Pentland; Margaret E Roberts; Azim Shariff; Joshua B Tenenbaum; Michael Wellman
Journal:  Nature       Date:  2019-04-24       Impact factor: 49.962

8.  Artificial intelligence accelerated by light.

Authors:  Huaqiang Wu; Qionghai Dai
Journal:  Nature       Date:  2021-01       Impact factor: 49.962

9.  Universal approximation with quadratic deep networks.

Authors:  Fenglei Fan; Jinjun Xiong; Ge Wang
Journal:  Neural Netw       Date:  2020-01-18

Review 10.  Machine learning in human movement biomechanics: Best practices, common pitfalls, and new opportunities.

Authors:  Eni Halilaj; Apoorva Rajagopal; Madalina Fiterau; Jennifer L Hicks; Trevor J Hastie; Scott L Delp
Journal:  J Biomech       Date:  2018-09-13       Impact factor: 2.712

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.