Literature DB >> 33500936

Space Debris Removal: Learning to Cooperate and the Price of Anarchy.

Richard Klima1, Daan Bloembergen2, Rahul Savani1, Karl Tuyls1, Alexander Wittig3, Andrei Sapera4, Dario Izzo4.   

Abstract

In this paper we study space debris removal from a game-theoretic perspective. In particular we focus on the question whether and how self-interested agents can cooperate in this dilemma, which resembles a tragedy of the commons scenario. We compare centralised and decentralised solutions and the corresponding price of anarchy, which measures the extent to which competition approximates cooperation. In addition we investigate whether agents can learn optimal strategies by reinforcement learning. To this end, we improve on an existing high fidelity orbital simulator, and use this simulator to obtain a computationally efficient surrogate model that can be used for our subsequent game-theoretic analysis. We study both single- and multi-agent approaches using stochastic (Markov) games and reinforcement learning. The main finding is that the cost of a decentralised, competitive solution can be significant, which should be taken into consideration when forming debris removal strategies.
Copyright © 2018 Klima, Bloembergen, Savani, Tuyls, Wittig, Sapera and Izzo.

Entities:  

Keywords:  active debris removal; markov decision process; price of anarchy; space debris; tragedy of the commons

Year:  2018        PMID: 33500936      PMCID: PMC7806007          DOI: 10.3389/frobt.2018.00054

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  4 in total

1.  Planetary science. Risks in space from orbiting debris.

Authors:  J C Liou; N L Johnson
Journal:  Science       Date:  2006-01-20       Impact factor: 47.728

2.  The tragedy of the commons. The population problem has no technical solution; it requires a fundamental extension in morality.

Authors:  G Hardin
Journal:  Science       Date:  1968-12-13       Impact factor: 47.728

3.  Stochastic Games.

Authors:  L S Shapley
Journal:  Proc Natl Acad Sci U S A       Date:  1953-10       Impact factor: 11.205

4.  Directional learning and the provisioning of public goods.

Authors:  Heinrich H Nax; Matjaž Perc
Journal:  Sci Rep       Date:  2015-01-26       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.