Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Human-level performance in 3D multiplayer games with population-based reinforcement learning.

Literature DB >> 31147514

Human-level performance in 3D multiplayer games with population-based reinforcement learning.

Max Jaderberg¹, Wojciech M Czarnecki¹, Iain Dunning², Luke Marris², Guy Lever², Antonio Garcia Castañeda², Charles Beattie², Neil C Rabinowitz², Ari S Morcos², Avraham Ruderman², Nicolas Sonnerat², Tim Green², Louise Deason², Joel Z Leibo², David Silver², Demis Hassabis², Koray Kavukcuoglu², Thore Graepel².

Abstract

Reinforcement learning (RL) has shown great success in increasingly complex single-agent environments and two-player turn-based games. However, the real world contains multiple agents, each learning and acting independently to cooperate and compete with other agents. We used a tournament-style evaluation to demonstrate that an agent can achieve human-level performance in a three-dimensional multiplayer first-person video game, Quake III Arena in Capture the Flag mode, using only pixels and game points scored as input. We used a two-tier optimization process in which a population of independent RL agents are trained concurrently from thousands of parallel matches on randomly generated environments. Each agent learns its own internal reward signal and rich representation of the world. These results indicate the great potential of multiagent reinforcement learning for artificial intelligence research.

Entities: Species

Year: 2019 PMID： 31147514 DOI： 10.1126/science.aau6249

Source DB: PubMed Journal: Science ISSN： 0036-8075 Impact factor: 47.728

Keyword Cloud
Cited

14 in total

1. Constructing and Forgetting Temporal Context in the Human Cerebral Cortex.

Authors: Hsiang-Yun Sherry Chien; Christopher J Honey
Journal: Neuron Date: 2020-03-11 Impact factor: 17.173

2. Deep learning for cardiovascular medicine: a practical primer.

Authors: Chayakrit Krittanawong; Kipp W Johnson; Robert S Rosenson; Zhen Wang; Mehmet Aydar; Usman Baber; James K Min; W H Wilson Tang; Jonathan L Halperin; Sanjiv M Narayan
Journal: Eur Heart J Date: 2019-07-01 Impact factor: 29.983

Review 3. Direct Fit to Nature: An Evolutionary Perspective on Biological and Artificial Neural Networks.

Authors: Uri Hasson; Samuel A Nastase; Ariel Goldstein
Journal: Neuron Date: 2020-02-05 Impact factor: 17.173

4. Machine Learned Cellular Phenotypes in Cardiomyopathy Predict Sudden Death.

Authors: Albert J Rogers; Anojan Selvalingam; Mahmood I Alhusseini; David E Krummen; Cesare Corrado; Firas Abuzaid; Tina Baykaner; Christian Meyer; Paul Clopton; Wayne Giles; Peter Bailis; Steven Niederer; Paul J Wang; Wouter-Jan Rappel; Matei Zaharia; Sanjiv M Narayan
Journal: Circ Res Date: 2020-11-10 Impact factor: 17.367

Review 5. Promises and challenges of human computational ethology.

Authors: Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal: Neuron Date: 2021-06-17 Impact factor: 18.688

6. Modeling the formation of social conventions from embodied real-time interactions.

Authors: Ismael T Freire; Clement Moulin-Frier; Marti Sanchez-Fibla; Xerxes D Arsiwalla; Paul F M J Verschure
Journal: PLoS One Date: 2020-06-22 Impact factor: 3.240

7. Voxel-Based State Space Modeling Recovers Task-Related Cognitive States in Naturalistic fMRI Experiments.

Authors: Tianjiao Zhang; James S Gao; Tolga Çukur; Jack L Gallant
Journal: Front Neurosci Date: 2021-05-06 Impact factor: 4.677

8. A community-powered search of machine learning strategy space to find NMR property prediction models.

Authors: Lars A Bratholm; Will Gerrard; Brandon Anderson; Shaojie Bai; Sunghwan Choi; Lam Dang; Pavel Hanchar; Addison Howard; Sanghoon Kim; Zico Kolter; Risi Kondor; Mordechai Kornbluth; Youhan Lee; Youngsoo Lee; Jonathan P Mailoa; Thanh Tu Nguyen; Milos Popovic; Goran Rakocevic; Walter Reade; Wonho Song; Luka Stojanovic; Erik H Thiede; Nebojsa Tijanic; Andres Torrubia; Devin Willmott; Craig P Butts; David R Glowacki
Journal: PLoS One Date: 2021-07-20 Impact factor: 3.240

9. Multi-agent reinforcement learning with approximate model learning for competitive games.

Authors: Young Joon Park; Yoon Sang Cho; Seoung Bum Kim
Journal: PLoS One Date: 2019-09-11 Impact factor: 3.240

Review 10. Allosteric Regulation at the Crossroads of New Technologies: Multiscale Modeling, Networks, and Machine Learning.

Authors: Gennady M Verkhivker; Steve Agajanian; Guang Hu; Peng Tao
Journal: Front Mol Biosci Date: 2020-07-09