Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.

Literature DB >> 17008410

Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.

Yonatan Loewenstein¹, H Sebastian Seung.

Abstract

The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.

Mesh：

Year: 2006 PMID： 17008410 PMCID： PMC1622804 DOI： 10.1073/pnas.0505220103

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

21 in total

1. Matching behavior and the representation of value in the parietal cortex.

Authors: Leo P Sugrue; Greg S Corrado; William T Newsome
Journal: Science Date: 2004-06-18 Impact factor: 47.728

Review 2. Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology.

Authors: Wolfram Schultz
Journal: Curr Opin Neurobiol Date: 2004-04 Impact factor: 6.627

Review 3. Indeterminacy in brain and behavior.

Authors: Paul W Glimcher
Journal: Annu Rev Psychol Date: 2005 Impact factor: 24.137

10. Operant generalization in quail neonates after intradimensional training: Distinguishing positive and negative reinforcement.

Authors: Susan M Schneider; Robert Lickliter
Journal: Behav Processes Date: 2009-08-25 Impact factor: 1.777

Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.

1. Matching behavior and the representation of value in the parietal cortex.

Review 2. Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology.

Review 3. Indeterminacy in brain and behavior.

4. Midbrain dopamine neurons encode a quantitative reward prediction error signal.

5. Linear-Nonlinear-Poisson models of primate choice dynamics.

6. Dynamic response-by-response models of matching behavior in rhesus monkeys.

Review 7. A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

8. Operant conditioning of cortical unit activity.

9. Activity in posterior parietal cortex is correlated with the relative subjective desirability of action.

Review 10. Predictive reward signal of dopamine neurons.

1. A symbolic/subsymbolic interface protocol for cognitive modeling.

2. A neural circuit model of flexible sensorimotor mapping: learning and forgetting on multiple timescales.

3. Learning reward timing in cortex through reward dependent expression of synaptic plasticity.

4. Spatial generalization in operant learning: lessons from professional basketball.

5. Optimal decision making and matching are tied through diminishing returns.

6. Striatal action-value neurons reconsidered.

7. Dynamical regimes in neural network models of matching behavior.

8. Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

9. Synaptic theory of replicator-like melioration.

10. Operant generalization in quail neonates after intradimensional training: Distinguishing positive and negative reinforcement.