Literature DB >> 31147514

Human-level performance in 3D multiplayer games with population-based reinforcement learning.

Max Jaderberg1, Wojciech M Czarnecki1, Iain Dunning2, Luke Marris2, Guy Lever2, Antonio Garcia Castañeda2, Charles Beattie2, Neil C Rabinowitz2, Ari S Morcos2, Avraham Ruderman2, Nicolas Sonnerat2, Tim Green2, Louise Deason2, Joel Z Leibo2, David Silver2, Demis Hassabis2, Koray Kavukcuoglu2, Thore Graepel2.   

Abstract

Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, Quake III Arena in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research.
Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.

Entities:  

Year:  2019        PMID: 31147514     DOI: 10.1126/science.aau6249

Source DB:  PubMed          Journal:  Science        ISSN: 0036-8075            Impact factor:   47.728


  14 in total

1.  Constructing and Forgetting Temporal Context in the Human Cerebral Cortex.

Authors:  Hsiang-Yun Sherry Chien; Christopher J Honey
Journal:  Neuron       Date:  2020-03-11       Impact factor: 17.173

2.  Deep learning for cardiovascular medicine: a practical primer.

Authors:  Chayakrit Krittanawong; Kipp W Johnson; Robert S Rosenson; Zhen Wang; Mehmet Aydar; Usman Baber; James K Min; W H Wilson Tang; Jonathan L Halperin; Sanjiv M Narayan
Journal:  Eur Heart J       Date:  2019-07-01       Impact factor: 29.983

Review 3.  Direct Fit to Nature: An Evolutionary Perspective on Biological and Artificial Neural Networks.

Authors:  Uri Hasson; Samuel A Nastase; Ariel Goldstein
Journal:  Neuron       Date:  2020-02-05       Impact factor: 17.173

4.  Machine Learned Cellular Phenotypes in Cardiomyopathy Predict Sudden Death.

Authors:  Albert J Rogers; Anojan Selvalingam; Mahmood I Alhusseini; David E Krummen; Cesare Corrado; Firas Abuzaid; Tina Baykaner; Christian Meyer; Paul Clopton; Wayne Giles; Peter Bailis; Steven Niederer; Paul J Wang; Wouter-Jan Rappel; Matei Zaharia; Sanjiv M Narayan
Journal:  Circ Res       Date:  2020-11-10       Impact factor: 17.367

Review 5.  Promises and challenges of human computational ethology.

Authors:  Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal:  Neuron       Date:  2021-06-17       Impact factor: 18.688

6.  Modeling the formation of social conventions from embodied real-time interactions.

Authors:  Ismael T Freire; Clement Moulin-Frier; Marti Sanchez-Fibla; Xerxes D Arsiwalla; Paul F M J Verschure
Journal:  PLoS One       Date:  2020-06-22       Impact factor: 3.240

7.  Voxel-Based State Space Modeling Recovers Task-Related Cognitive States in Naturalistic fMRI Experiments.

Authors:  Tianjiao Zhang; James S Gao; Tolga Çukur; Jack L Gallant
Journal:  Front Neurosci       Date:  2021-05-06       Impact factor: 4.677

8.  A community-powered search of machine learning strategy space to find NMR property prediction models.

Authors:  Lars A Bratholm; Will Gerrard; Brandon Anderson; Shaojie Bai; Sunghwan Choi; Lam Dang; Pavel Hanchar; Addison Howard; Sanghoon Kim; Zico Kolter; Risi Kondor; Mordechai Kornbluth; Youhan Lee; Youngsoo Lee; Jonathan P Mailoa; Thanh Tu Nguyen; Milos Popovic; Goran Rakocevic; Walter Reade; Wonho Song; Luka Stojanovic; Erik H Thiede; Nebojsa Tijanic; Andres Torrubia; Devin Willmott; Craig P Butts; David R Glowacki
Journal:  PLoS One       Date:  2021-07-20       Impact factor: 3.240

9.  Multi-agent reinforcement learning with approximate model learning for competitive games.

Authors:  Young Joon Park; Yoon Sang Cho; Seoung Bum Kim
Journal:  PLoS One       Date:  2019-09-11       Impact factor: 3.240

Review 10.  Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning.

Authors:  Gennady M Verkhivker; Steve Agajanian; Guang Hu; Peng Tao
Journal:  Front Mol Biosci       Date:  2020-07-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.