| Literature DB >> 33057134 |
Nikoleta E Glynatsi1,2, Vincent A Knight3.
Abstract
Memory-one strategies are a set of Iterated Prisoner's Dilemma strategies that have been praised for their mathematical tractability and performance against single opponents. This manuscript investigates best response memory-one strategies with a theory of mind for their opponents. The results add to the literature that has shown that extortionate play is not always optimal by showing that optimal play is often not extortionate. They also provide evidence that memory-one strategies suffer from their limited memory in multi agent interactions and can be out performed by optimised strategies with longer memory. We have developed a theory that has allowed to explore the entire space of memory-one strategies. The framework presented is suitable to study memory-one strategies in the Prisoner's Dilemma, but also in evolutionary processes such as the Moran process. Furthermore, results on the stability of defection in populations of memory-one strategies are also obtained.Entities:
Mesh:
Year: 2020 PMID: 33057134 PMCID: PMC7560663 DOI: 10.1038/s41598-020-74181-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1SEE distribution for best response in tournaments with .
SSE of best response memory-one when .
| Mean | Std | 5% | 50% | 95% | Max | Median | Skew | Kurt |
|---|---|---|---|---|---|---|---|---|
| 0.34 | 0.40 | 0.028 | 0.17 | 1.05 | 2.47 | 0.17 | 1.87 | 3.60 |
Figure 2The ratio between the utilities of Gambler and best response memory-one strategy for 152 different pair of opponents.
Figure 3Results for the best response player in a dynamic Moran process. The ratio is taken as the ratio of of the dynamically updating player to the fixation probability of a best response player that does not update as the population density changes.