Literature DB >> 31666705

Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Oriol Vinyals1, Igor Babuschkin2, Wojciech M Czarnecki2, Michaël Mathieu2, Andrew Dudzik2, Junyoung Chung2, David H Choi2, Richard Powell2, Timo Ewalds2, Petko Georgiev2, Junhyuk Oh2, Dan Horgan2, Manuel Kroiss2, Ivo Danihelka2, Aja Huang2, Laurent Sifre2, Trevor Cai2, John P Agapiou2, Chris Apps2, David Silver3, Max Jaderberg2, Alexander S Vezhnevets2, Rémi Leblond2, Tobias Pohlen2, Valentin Dalibard2, David Budden2, Yury Sulsky2, James Molloy2, Tom L Paine2, Caglar Gulcehre2, Ziyu Wang2, Tobias Pfaff2, Yuhuai Wu2, Roman Ring2, Dani Yogatama2, Dario Wünsch4, Katrina McKinney2, Oliver Smith2, Tom Schaul2, Timothy Lillicrap2, Koray Kavukcuoglu2, Demis Hassabis2.   

Abstract

Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal, the domain of StarCraft has emerged as an important challenge for artificial intelligence research, owing to its iconic and enduring status among the most difficult professional esports and its relevance to the real world in terms of its raw complexity and multi-agent challenges. Over the course of a decade and numerous competitions1-3, the strongest agents have simplified important aspects of the game, utilized superhuman capabilities, or employed hand-crafted sub-systems4. Despite these advantages, no previous agent has come close to matching the overall skill of top StarCraft players. We chose to address the challenge of StarCraft using general-purpose learning methods that are in principle applicable to other complex domains: a multi-agent reinforcement learning algorithm that uses data from both human and agent games within a diverse league of continually adapting strategies and counter-strategies, each represented by deep neural networks5,6. We evaluated our agent, AlphaStar, in the full game of StarCraft II, through a series of online games against human players. AlphaStar was rated at Grandmaster level for all three StarCraft races and above 99.8% of officially ranked human players.

Entities:  

Mesh:

Year:  2019        PMID: 31666705     DOI: 10.1038/s41586-019-1724-z

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  50 in total

1.  Human locomotion with reinforcement learning using bioinspired reward reshaping strategies.

Authors:  Katharine Nowakowski; Philippe Carvalho; Jean-Baptiste Six; Yann Maillet; Anh Tu Nguyen; Ismail Seghiri; Loick M'Pemba; Theo Marcille; Sy Toan Ngo; Tien-Tuan Dao
Journal:  Med Biol Eng Comput       Date:  2021-01-08       Impact factor: 2.602

2.  The scientific events that shaped the decade.

Authors: 
Journal:  Nature       Date:  2019-12       Impact factor: 49.962

3.  Transforming task representations to perform novel tasks.

Authors:  Andrew K Lampinen; James L McClelland
Journal:  Proc Natl Acad Sci U S A       Date:  2020-12-10       Impact factor: 11.205

4.  What do Reinforcement Learning Models Measure? Interpreting Model Parameters in Cognition and Neuroscience.

Authors:  Maria K Eckstein; Linda Wilbrecht; Anne G E Collins
Journal:  Curr Opin Behav Sci       Date:  2021-07-03

Review 5.  Future Challenges in Plant Systems Biology.

Authors:  Mikaël Lucas
Journal:  Methods Mol Biol       Date:  2022

6.  Inferring learning rules from animal decision-making.

Authors:  Zoe C Ashwood; Nicholas A Roy; Ji Hyun Bak; Jonathan W Pillow
Journal:  Adv Neural Inf Process Syst       Date:  2020

7.  Continuous decisions.

Authors:  Seng Bum Michael Yoo; Benjamin Yost Hayden; John M Pearson
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-01-11       Impact factor: 6.237

Review 8.  Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms.

Authors:  Mohammed AlQuraishi; Peter K Sorger
Journal:  Nat Methods       Date:  2021-10-04       Impact factor: 28.547

Review 9.  Promises and challenges of human computational ethology.

Authors:  Dean Mobbs; Toby Wise; Nanthia Suthana; Noah Guzmán; Nikolaus Kriegeskorte; Joel Z Leibo
Journal:  Neuron       Date:  2021-06-17       Impact factor: 18.688

10.  Learning Macromanagement in Starcraft by Deep Reinforcement Learning.

Authors:  Wenzhen Huang; Qiyue Yin; Junge Zhang; Kaiqi Huang
Journal:  Sensors (Basel)       Date:  2021-05-11       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.