Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit.

Literature DB >> 19596631

Simple artificial neural networks that match probability and exploit and explore when confronting a multiarmed bandit.

Michael R W Dawson¹, Brian Dupuis, Marcia L Spetch, Debbie M Kelly.

Abstract

The matching law (Herrnstein 1961) states that response rates become proportional to reinforcement rates; this is related to the empirical phenomenon called probability matching (Vulkan 2000). Here, we show that a simple artificial neural network generates responses consistent with probability matching. This behavior was then used to create an operant procedure for network learning. We use the multiarmed bandit (Gittins 1989), a classic problem of choice behavior, to illustrate that operant training balances exploiting the bandit arm expected to pay off most frequently with exploring other arms. Perceptrons provide a medium for relating results from neural networks, genetic algorithms, animal learning, contingency theory, reinforcement learning, and theories of choice.

Entities: Disease

Mesh：

Year: 2009 PMID： 19596631 DOI： 10.1109/TNN.2009.2025588

Source DB: PubMed Journal: IEEE Trans Neural Netw ISSN： 1045-9227

Keyword Cloud
Cited

2 in total

1. Get out of the corner: Inhibition and the effect of location type and number on perceptron and human reorientation.

Authors: Brian Dupuis; Michael R W Dawson
Journal: Learn Behav Date: 2013-12 Impact factor: 1.986

2. Probability matching in perceptrons: Effects of conditional dependence and linear nonseparability.

Authors: Michael R W Dawson; Maya Gupta
Journal: PLoS One Date: 2017-02-17 Impact factor: 3.240

2 in total