Literature DB >> 26819042

Mastering the game of Go with deep neural networks and tree search.

David Silver1, Aja Huang1, Chris J Maddison1, Arthur Guez1, Laurent Sifre1, George van den Driessche1, Julian Schrittwieser1, Ioannis Antonoglou1, Veda Panneershelvam1, Marc Lanctot1, Sander Dieleman1, Dominik Grewe1, John Nham2, Nal Kalchbrenner1, Ilya Sutskever2, Timothy Lillicrap1, Madeleine Leach1, Koray Kavukcuoglu1, Thore Graepel1, Demis Hassabis1.   

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and 'policy networks' to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

Entities:  

Mesh:

Year:  2016        PMID: 26819042     DOI: 10.1038/nature16961

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  504 in total

1.  Google AI algorithm masters ancient game of Go.

Authors:  Elizabeth Gibney
Journal:  Nature       Date:  2016-01-28       Impact factor: 49.962

2.  DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins.

Authors:  Hamid Reza Hassanzadeh; May D Wang
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2017-01-19

Review 3.  Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors:  Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal:  Chem Rev       Date:  2020-06-10       Impact factor: 60.622

4.  What Caused What? A Quantitative Account of Actual Causation Using Dynamical Causal Networks.

Authors:  Larissa Albantakis; William Marshall; Erik Hoel; Giulio Tononi
Journal:  Entropy (Basel)       Date:  2019-05-02       Impact factor: 2.524

Review 5.  Reinventing polysomnography in the age of precision medicine.

Authors:  Diane C Lim; Diego R Mazzotti; Kate Sutherland; Jesse W Mindel; Jinyoung Kim; Peter A Cistulli; Ulysses J Magalang; Allan I Pack; Philip de Chazal; Thomas Penzel
Journal:  Sleep Med Rev       Date:  2020-03-20       Impact factor: 11.609

6.  Convergent Temperature Representations in Artificial and Biological Neural Networks.

Authors:  Martin Haesemeyer; Alexander F Schier; Florian Engert
Journal:  Neuron       Date:  2019-07-31       Impact factor: 17.173

7.  Visible Machine Learning for Biomedicine.

Authors:  Michael K Yu; Jianzhu Ma; Jasmin Fisher; Jason F Kreisberg; Benjamin J Raphael; Trey Ideker
Journal:  Cell       Date:  2018-06-14       Impact factor: 41.582

8.  Can the artificial intelligence technique of reinforcement learning use continuously-monitored digital data to optimize treatment for weight loss?

Authors:  Evan M Forman; Stephanie G Kerrigan; Meghan L Butryn; Adrienne S Juarascio; Stephanie M Manasse; Santiago Ontañón; Diane H Dallal; Rebecca J Crochiere; Danielle Moskow
Journal:  J Behav Med       Date:  2018-08-25

9.  Blending computational and experimental neuroscience.

Authors:  Patricia S Churchland; Terrence J Sejnowski
Journal:  Nat Rev Neurosci       Date:  2016-09-09       Impact factor: 34.870

10.  Deep Reinforcement Learning for Dynamic Treatment Regimes on Medical Registry Data.

Authors:  Ying Liu; Brent Logan; Ning Liu; Zhiyuan Xu; Jian Tang; Yanzhi Wang
Journal:  Healthc Inform       Date:  2017-08
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.