Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Mastering the game of Go with deep neural networks and tree search.

Literature DB >> 26819042

Mastering the game of Go with deep neural networks and tree search.

David Silver¹, Aja Huang¹, Chris J Maddison¹, Arthur Guez¹, Laurent Sifre¹, George van den Driessche¹, Julian Schrittwieser¹, Ioannis Antonoglou¹, Veda Panneershelvam¹, Marc Lanctot¹, Sander Dieleman¹, Dominik Grewe¹, John Nham², Nal Kalchbrenner¹, Ilya Sutskever², Timothy Lillicrap¹, Madeleine Leach¹, Koray Kavukcuoglu¹, Thore Graepel¹, Demis Hassabis¹.

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

Entities: Species

Mesh：

Year: 2016 PMID： 26819042 DOI： 10.1038/nature16961

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

Keyword Cloud
Cited

504 in total

1. Google AI algorithm masters ancient game of Go.

Authors: Elizabeth Gibney
Journal: Nature Date: 2016-01-28 Impact factor: 49.962

2. DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors: Hamid Reza Hassanzadeh; May D Wang
Journal: Proceedings (IEEE Int Conf Bioinformatics Biomed) Date: 2017-01-19

Review 3. Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors: Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal: Chem Rev Date: 2020-06-10 Impact factor: 60.622

4. What Caused What? A Quantitative Account of Actual Causation Using Dynamical Causal Networks.

Authors: Larissa Albantakis; William Marshall; Erik Hoel; Giulio Tononi
Journal: Entropy (Basel) Date: 2019-05-02 Impact factor: 2.524

Review 5. Reinventing polysomnography in the age of precision medicine.

Authors: Diane C Lim; Diego R Mazzotti; Kate Sutherland; Jesse W Mindel; Jinyoung Kim; Peter A Cistulli; Ulysses J Magalang; Allan I Pack; Philip de Chazal; Thomas Penzel
Journal: Sleep Med Rev Date: 2020-03-20 Impact factor: 11.609

6. Convergent Temperature Representations in Artificial and Biological Neural Networks.

Authors: Martin Haesemeyer; Alexander F Schier; Florian Engert
Journal: Neuron Date: 2019-07-31 Impact factor: 17.173

7. Visible Machine Learning for Biomedicine.

Authors: Michael K Yu; Jianzhu Ma; Jasmin Fisher; Jason F Kreisberg; Benjamin J Raphael; Trey Ideker
Journal: Cell Date: 2018-06-14 Impact factor: 41.582

8. Can the artificial intelligence technique of reinforcement learning use continuously-monitored digital data to optimize treatment for weight loss?

Authors: Evan M Forman; Stephanie G Kerrigan; Meghan L Butryn; Adrienne S Juarascio; Stephanie M Manasse; Santiago Ontañón; Diane H Dallal; Rebecca J Crochiere; Danielle Moskow
Journal: J Behav Med Date: 2018-08-25

9. Blending computational and experimental neuroscience.

Authors: Patricia S Churchland; Terrence J Sejnowski
Journal: Nat Rev Neurosci Date: 2016-09-09 Impact factor: 34.870

10. Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

Authors: Ying Liu; Brent Logan; Ning Liu; Zhiyuan Xu; Jian Tang; Yanzhi Wang
Journal: Healthc Inform Date: 2017-08