| Literature DB >> 20949101 |
Dimitris Iliopoulos1, Arend Hintze, Christoph Adami.
Abstract
The observed cooperation on the level of genes, cells, tissues, and individuals has been the object of intense study by evolutionary biologists, mainly because cooperation often flourishes in biological systems in apparent contradiction to the selfish goal of survival inherent in Darwinian evolution. In order to resolve this paradox, evolutionary game theory has focused on the Prisoner's Dilemma (PD), which incorporates the essence of this conflict. Here, we encode strategies for the iterated Prisoner's Dilemma (IPD) in terms of conditional probabilities that represent the response of decision pathways given previous plays. We find that if these stochastic strategies are encoded as genes that undergo Darwinian evolution, the environmental conditions that the strategies are adapting to determine the fixed point of the evolutionary trajectory, which could be either cooperation or defection. A transition between cooperative and defective attractors occurs as a function of different parameters such as mutation rate, replacement rate, and memory, all of which affect a player's ability to predict an opponent's behavior. These results imply that in populations of players that can use previous decisions to plan future ones, cooperation depends critically on whether the players can rely on facing the same strategies that they have adapted to. Defection, on the other hand, is the optimal adaptive response in environments that change so quickly that the information gathered from previous plays cannot usefully be integrated for a response.Entities:
Mesh:
Year: 2010 PMID: 20949101 PMCID: PMC2951343 DOI: 10.1371/journal.pcbi.1000948
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Figure 1Evolutionary trajectories and attractors.
All trajectories start at the same point (START), and move towards the strategy marked by ‘END’. Several well-known strategies provide landmarks in strategy space: TFT: (P, P, P, P) = (1,0,1,0), ALLC = (1,1,1,1), ALLD = (0,0,0,0), WSLS = (1,0,0,1), GTFT = (1,0.333,1,0), START = (0.5,0.5,0.5,0.5). All experiments shown are run in a spatially-structured environment at replacement rate r = 1%. Trajectories for well-mixed populations are shown in Fig. S3. (A) Evolution of the average LOD for μ = 0.5%. RC marks the consensus genotype (see Methods) of this trajectory, while RAND marks the consensus genotype at μ = 50%, when the population drifts neutrally. This attractor is not the same as ‘END’ because that genotype lies past the most recent common ancestor of the population. (B) Trajectory for μ = 2.5%, close to the critical mutation rate. (C) Trajectory for μ = 5%. ‘RD’ marks the consensus genotype for these parameters. (D) Trajectories emanating from 16 deterministic strategies (at μ = 0.5%) suggest that the fixed point is unique. Blue symbols: start, red dots: end points. Symbols: ⧫: TFT, ▪: ALLC, ★: ALLD, ▴: WSLS. A–D use principal components of trajectory shown in A.
Figure 2Transitions in strategy space.
(A) The order parameter m defined in Eq. [1] as a function of the mutation rate for a spatially-structured and a well-mixed population, obtained from play statistics averaged over 80 independent runs each (see Text S2). Errors are two standard errors. (B) Qualitative phase diagram as a function of μ and r for spatially-structured populations, where light grey indicates cooperation and black indicates defection. (C) Phase diagram for well-mixed populations. Both phase diagrams with quantitative levels of cooperation are shown in Fig. S4.
Figure 3Evolution of consensus genotypes.
Mean of probabilities of the consensus genotype as a function of mutation rate (r = 1%). Colored areas represent the variance of the probability distribution, and reflect the strength of selection. (P is omitted because it drifts neutrally, see Methods). Vertical lines drawn at the critical mutation rate. (A) Spatially-structured. (B) Well-mixed.
Figure 4Strategy evolution under changing mutation rates.
Order parameter m as a function of update time for an experiment with five changes in mutation rate, starting with a type adapted to a high mutation rate of 5% (defection regime). We show the order parameter for the average LOD of 80 runs with the same regime of mutation rate changes. The population reacts to a changed mutation rate quickly, and settles around the fixed point appropriate for that mutation rate, indicated in the figure.
Figure 5Order parameter in different environments for spatially-structured populations.
(A) Phase transition for populations playing with memories of different size as a function of genomic mutation rate μL, where L = 5 for memory-one strategies (D1, blue line) and L = 21 for memory-two strategies (D2, pink line). (B) Phase transition for environments with different resolutions of strategy space, from 15 bits per gene to 1 bit per gene (deterministic strategies). Colors as in legend.