Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Literature DB >> 31666705

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Oriol Vinyals¹, Igor Babuschkin², Wojciech M Czarnecki², Michaël Mathieu², Andrew Dudzik², Junyoung Chung², David H Choi², Richard Powell², Timo Ewalds², Petko Georgiev², Junhyuk Oh², Dan Horgan², Manuel Kroiss², Ivo Danihelka², Aja Huang², Laurent Sifre², Trevor Cai², John P Agapiou², Chris Apps², David Silver³, Max Jaderberg², Alexander S Vezhnevets², Rémi Leblond², Tobias Pohlen², Valentin Dalibard², David Budden², Yury Sulsky², James Molloy², Tom L Paine², Caglar Gulcehre², Ziyu Wang², Tobias Pfaff², Yuhuai Wu², Roman Ring², Dani Yogatama², Dario Wünsch⁴, Katrina McKinney², Oliver Smith², Tom Schaul², Timothy Lillicrap², Koray Kavukcuoglu², Demis Hassabis².

Abstract

Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal, the domain of StarCraft has emerged as an important challenge for artificial intelligence research, owing to its iconic and enduring status among the most difficult professional esports and its relevance to the real world in terms of its raw complexity and multi-agent challenges. Over the course of a decade and numerous competitions1-3, the strongest agents have simplified important aspects of the game, utilized superhuman capabilities, or employed hand-crafted sub-systems4. Despite these advantages, no previous agent has come close to matching the overall skill of top StarCraft players. We chose to address the challenge of StarCraft using general-purpose learning methods that are in principle applicable to other complex domains: a multi-agent reinforcement learning algorithm that uses data from both human and agent games within a diverse league of continually adapting strategies and counter-strategies, each represented by deep neural networks5,6. We evaluated our agent, AlphaStar, in the full game of StarCraft II, through a series of online games against human players. AlphaStar was rated at Grandmaster level for all three StarCraft races and above 99.8% of officially ranked human players.

Entities: Chemical

Mesh：

Year: 2019 PMID： 31666705 DOI： 10.1038/s41586-019-1724-z

Source DB: PubMed Journal: Nature ISSN： 0028-0836 Impact factor: 49.962

Keyword Cloud
Cited

50 in total

1. Human locomotion with reinforcement learning using bioinspired reward reshaping strategies.

Authors: Katharine Nowakowski; Philippe Carvalho; Jean-Baptiste Six; Yann Maillet; Anh Tu Nguyen; Ismail Seghiri; Loick M'Pemba; Theo Marcille; Sy Toan Ngo; Tien-Tuan Dao
Journal: Med Biol Eng Comput Date: 2021-01-08 Impact factor: 2.602

2. The scientific events that shaped the decade.

Authors:
Journal: Nature Date: 2019-12 Impact factor: 49.962

3. Transforming task representations to perform novel tasks.

Authors: Andrew K Lampinen; James L McClelland
Journal: Proc Natl Acad Sci U S A Date: 2020-12-10 Impact factor: 11.205

4. What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience.

Authors: Maria K Eckstein; Linda Wilbrecht; Anne G E Collins
Journal: Curr Opin Behav Sci Date: 2021-07-03

Review 5. Future Challenges in Plant Systems Biology.

Authors: Mikaël Lucas
Journal: Methods Mol Biol Date: 2022

6. Inferring learning rules from animal decision-making.

Authors: Zoe C Ashwood; Nicholas A Roy; Ji Hyun Bak; Jonathan W Pillow
Journal: Adv Neural Inf Process Syst Date: 2020

7. Continuous decisions.

Authors: Seng Bum Michael Yoo; Benjamin Yost Hayden; John M Pearson
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2021-01-11 Impact factor: 6.237

Review 8. Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms.

Authors: Mohammed AlQuraishi; Peter K Sorger
Journal: Nat Methods Date: 2021-10-04 Impact factor: 28.547

Review 9. Promises and challenges of human computational ethology.

Authors: Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal: Neuron Date: 2021-06-17 Impact factor: 18.688

10. Learning Macromanagement in Starcraft by Deep Reinforcement Learning.

Authors: Wenzhen Huang; Qiyue Yin; Junge Zhang; Kaiqi Huang
Journal: Sensors (Basel) Date: 2021-05-11 Impact factor: 3.576