| Literature DB >> 31666705 |
Oriol Vinyals1, Igor Babuschkin2, Wojciech M Czarnecki2, Michaël Mathieu2, Andrew Dudzik2, Junyoung Chung2, David H Choi2, Richard Powell2, Timo Ewalds2, Petko Georgiev2, Junhyuk Oh2, Dan Horgan2, Manuel Kroiss2, Ivo Danihelka2, Aja Huang2, Laurent Sifre2, Trevor Cai2, John P Agapiou2, Chris Apps2, David Silver3, Max Jaderberg2, Alexander S Vezhnevets2, Rémi Leblond2, Tobias Pohlen2, Valentin Dalibard2, David Budden2, Yury Sulsky2, James Molloy2, Tom L Paine2, Caglar Gulcehre2, Ziyu Wang2, Tobias Pfaff2, Yuhuai Wu2, Roman Ring2, Dani Yogatama2, Dario Wünsch4, Katrina McKinney2, Oliver Smith2, Tom Schaul2, Timothy Lillicrap2, Koray Kavukcuoglu2, Demis Hassabis2.
Abstract
Many real-world applications require artificial agents to compete and coordinate with other agents in complex environments. As a stepping stone to this goal, the domain of StarCraft has emerged as an important challenge for artificial intelligence research, owing to its iconic and enduring status among the most difficult professional esports and its relevance to the real world in terms of its raw complexity and multi-agent challenges. Over the course of a decade and numerous competitions1-3, the strongest agents have simplified important aspects of the game, utilized superhuman capabilities, or employed hand-crafted sub-systems4. Despite these advantages, no previous agent has come close to matching the overall skill of top StarCraft players. We chose to address the challenge of StarCraft using general-purpose learning methods that are in principle applicable to other complex domains: a multi-agent reinforcement learning algorithm that uses data from both human and agent games within a diverse league of continually adapting strategies and counter-strategies, each represented by deep neural networks5,6. We evaluated our agent, AlphaStar, in the full game of StarCraft II, through a series of online games against human players. AlphaStar was rated at Grandmaster level for all three StarCraft races and above 99.8% of officially ranked human players.Entities:
Mesh:
Year: 2019 PMID: 31666705 DOI: 10.1038/s41586-019-1724-z
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962