| Literature DB >> 27476604 |
V Sasidevan1, Sitabhra Sinha1.
Abstract
Strategies incorporating direct reciprocity, e.g., Tit-for-Tat and Pavlov, have been shown to be successful for playing the Iterated Prisoners Dilemma (IPD), a paradigmatic problem for studying the evolution of cooperation among non-kin individuals. However it is an open question whether such reciprocal strategies can emerge as the rational outcome of repeated interactions between selfish agents. Here we show that adopting a co-action perspective, which takes into account the symmetry between agents - a relevant consideration in biological and social contexts - naturally leads to such a strategy. For a 2-player IPD, we show that the co-action solution corresponds to the Pavlov strategy, thereby providing a rational basis for it. For an IPD involving many players, an instance of the Public Goods game where cooperation is generally considered to be harder to achieve, we show that the cooperators always outnumber defectors in the co-action equilibrium. This can be seen as a generalization of Pavlov to contests involving many players. In general, repeated interactions allow rational agents to become aware of the inherent symmetry of their situation, enabling them to achieve robust cooperation through co-action strategies - which, in the case of IPD, is a reciprocal Pavlovian one.Entities:
Year: 2016 PMID: 27476604 PMCID: PMC4967866 DOI: 10.1038/srep30831
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) A generic representation of the payoff matrix for a two-person PD (T > R > P > 0, with the sucker’s payoff S assumed to be zero for convenience). At every round, each agent can choose one of two possible actions, cooperate (C) or defect (D). For each pair of actions, the first entry in each payoff pair belongs to Agent A while the second belongs to Agent B. (b) Representation of the Pavlov strategy corresponding to the co-action solution of the two-agent IPD. The arrows connect the optimal actions of Agents A and B (in order) in the present round to information about their actions in the previous round.
Figure 2Representation of possible payoffs in an N-player IPD under the co-action framework, when i agents cooperated and N−i agents defected in the previous round.