Literature DB >> 21397610

Numerical analysis of a reinforcement learning model with the dynamic aspiration level in the iterated Prisoner's dilemma.

Naoki Masuda1, Mitsuhiro Nakamura.   

Abstract

Humans and other animals can adapt their social behavior in response to environmental cues including the feedback obtained through experience. Nevertheless, the effects of the experience-based learning of players in evolution and maintenance of cooperation in social dilemma games remain relatively unclear. Some previous literature showed that mutual cooperation of learning players is difficult or requires a sophisticated learning model. In the context of the iterated Prisoner's dilemma, we numerically examine the performance of a reinforcement learning model. Our model modifies those of Karandikar et al. (1998), Posch et al. (1999), and Macy and Flache (2002) in which players satisfy if the obtained payoff is larger than a dynamic threshold. We show that players obeying the modified learning mutually cooperate with high probability if the dynamics of threshold is not too fast and the association between the reinforcement signal and the action in the next round is sufficiently strong. The learning players also perform efficiently against the reactive strategy. In evolutionary dynamics, they can invade a population of players adopting simpler but competitive strategies. Our version of the reinforcement learning model does not complicate the previous model and is sufficiently simple yet flexible. It may serve to explore the relationships between learning and evolution in social dilemma situations.
Copyright © 2011 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2011        PMID: 21397610     DOI: 10.1016/j.jtbi.2011.03.005

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  3 in total

1.  Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin.

Authors:  Takahiro Ezaki; Yutaka Horita; Masanori Takezawa; Naoki Masuda
Journal:  PLoS Comput Biol       Date:  2016-07-20       Impact factor: 4.475

2.  Reinforcement learning accounts for moody conditional cooperation behavior: experimental results.

Authors:  Yutaka Horita; Masanori Takezawa; Keigo Inukai; Toshimasa Kita; Naoki Masuda
Journal:  Sci Rep       Date:  2017-01-10       Impact factor: 4.379

3.  Reinforcement learning account of network reciprocity.

Authors:  Takahiro Ezaki; Naoki Masuda
Journal:  PLoS One       Date:  2017-12-08       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.