Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Mastering the game of Go without human knowledge.

Literature DB >> 29052630

Mastering the game of Go without human knowledge.

David Silver¹, Julian Schrittwieser¹, Karen Simonyan¹, Ioannis Antonoglou¹, Aja Huang¹, Arthur Guez¹, Thomas Hubert¹, Lucas Baker¹, Matthew Lai¹, Adrian Bolton¹, Yutian Chen¹, Timothy Lillicrap¹, Fan Hui¹, Laurent Sifre¹, George van den Driessche¹, Thore Graepel¹, Demis Hassabis¹.

Abstract

A long-standing goal of artificial intelligence is an algorithm that learns, tabula rasa, superhuman proficiency in challenging domains. Recently, AlphaGo became the first program to defeat a world champion in the game of Go. The tree search in AlphaGo evaluated positions and selected moves using deep neural networks. These neural networks were trained by supervised learning from human expert moves, and by reinforcement learning from self-play. Here we introduce an algorithm based solely on reinforcement learning, without human data, guidance or domain knowledge beyond game rules. AlphaGo becomes its own teacher: a neural network is trained to predict AlphaGo's own move selections and also the winner of AlphaGo's games. This neural network improves the strength of the tree search, resulting in higher quality move selection and stronger self-play in the next iteration. Starting tabula rasa, our new program AlphaGo Zero achieved superhuman performance, winning 100-0 against the previously published, champion-defeating AlphaGo.

Entities: Gene Species

Mesh：

Year: 2017 PMID： 29052630 DOI： 10.1038/nature24270

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

6 in total

1. Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit.

Authors: R H Hahnloser; R Sarpeshkar; M A Mahowald; R J Douglas; H S Seung
Journal: Nature Date: 2000-06-22 Impact factor: 49.962

2. Mastering the game of Go with deep neural networks and tree search.

Authors: David Silver; Aja Huang; Chris J Maddison; Arthur Guez; Laurent Sifre; George van den Driessche; Julian Schrittwieser; Ioannis Antonoglou; Veda Panneershelvam; Marc Lanctot; Sander Dieleman; Dominik Grewe; John Nham; Nal Kalchbrenner; Ilya Sutskever; Timothy Lillicrap; Madeleine Leach; Koray Kavukcuoglu; Thore Graepel; Demis Hassabis
Journal: Nature Date: 2016-01-28 Impact factor: 49.962

Review 3. Deep learning.

Authors: Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal: Nature Date: 2015-05-28 Impact factor: 49.962

4. DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.

Authors: Matej Moravčík; Martin Schmid; Neil Burch; Viliam Lisý; Dustin Morrill; Nolan Bard; Trevor Davis; Kevin Waugh; Michael Johanson; Michael Bowling
Journal: Science Date: 2017-03-02 Impact factor: 47.728

5. Neocognitron: a self organizing neural network model for a mechanism of pattern recognition unaffected by shift in position.

Authors: K Fukushima
Journal: Biol Cybern Date: 1980 Impact factor: 2.086

6. Human-level control through deep reinforcement learning.

Authors: Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal: Nature Date: 2015-02-26 Impact factor: 49.962

6 in total

253 in total

1. Gamorithm.

Authors: Moshe Sipper; Jason H Moore
Journal: IEEE Trans Games Date: 2018-08-29

Review 2. Learning task-state representations.

Authors: Yael Niv
Journal: Nat Neurosci Date: 2019-09-24 Impact factor: 24.884

3. Clinical Personal Connectomics Using Hybrid PET/MRI.

Authors: Dong Soo Lee
Journal: Nucl Med Mol Imaging Date: 2019-01-15

4. Nuclear safety in the unexpected second nuclear era.

Authors: Yican Wu; Zhibin Chen; Zhen Wang; Shanqi Chen; Daochuan Ge; Chao Chen; Jiangtao Jia; Yazhou Li; Ming Jin; Tao Zhou; Fang Wang; Liqin Hu
Journal: Proc Natl Acad Sci U S A Date: 2019-08-19 Impact factor: 11.205

5. Artificial Neural Network with Composite Architectures for Prediction of Local Control in Radiotherapy.

Authors: Sunan Cui; Yi Luo; Huan Hsin Tseng; Randall K Ten Haken; Issam El Naqa
Journal: IEEE Trans Radiat Plasma Med Sci Date: 2018-11-29

6. Opinion: What does AI's success playing complex board games tell brain scientists?

Authors: Dale Purves
Journal: Proc Natl Acad Sci U S A Date: 2019-07-23 Impact factor: 11.205

Review 7. Machine behaviour.

Authors: Iyad Rahwan; Manuel Cebrian; Nick Obradovich; Josh Bongard; Jean-François Bonnefon; Cynthia Breazeal; Jacob W Crandall; Nicholas A Christakis; Iain D Couzin; Matthew O Jackson; Nicholas R Jennings; Ece Kamar; Isabel M Kloumann; Hugo Larochelle; David Lazer; Richard McElreath; Alan Mislove; David C Parkes; Alex 'Sandy' Pentland; Margaret E Roberts; Azim Shariff; Joshua B Tenenbaum; Michael Wellman
Journal: Nature Date: 2019-04-24 Impact factor: 49.962