| Literature DB >> 25899069 |
Maarten Speekenbrink1, Emmanouil Konstantinidis.
Abstract
Decision making in noisy and changing environments requires a fine balance between exploiting knowledge about good courses of action and exploring the environment in order to improve upon this knowledge. We present an experiment on a restless bandit task in which participants made repeated choices between options for which the average rewards changed over time. Comparing a number of computational models of participants' behavior in this task, we find evidence that a substantial number of them balanced exploration and exploitation by considering the probability that an option offers the maximum reward out of all the available options.Entities:
Keywords: Dynamic decision making; Exploration-exploitation trade-off; Restless multi-armed bandit task; Uncertainty; Volatility
Mesh:
Year: 2015 PMID: 25899069 DOI: 10.1111/tops.12145
Source DB: PubMed Journal: Top Cogn Sci ISSN: 1756-8757