Literature DB >> 17008410

Operant matching is a generic outcome of synaptic plasticity based on the covariance between reward and neural activity.

Yonatan Loewenstein1, H Sebastian Seung.   

Abstract

The probability of choosing an alternative in a long sequence of repeated choices is proportional to the total reward derived from that alternative, a phenomenon known as Herrnstein's matching law. This behavior is remarkably conserved across species and experimental conditions, but its underlying neural mechanisms still are unknown. Here, we propose a neural explanation of this empirical law of behavior. We hypothesize that there are forms of synaptic plasticity driven by the covariance between reward and neural activity and prove mathematically that matching is a generic outcome of such plasticity. Two hypothetical types of synaptic plasticity, embedded in decision-making neural network models, are shown to yield matching behavior in numerical simulations, in accord with our general theorem. We show how this class of models can be tested experimentally by making reward not only contingent on the choices of the subject but also directly contingent on fluctuations in neural activity. Maximization is shown to be a generic outcome of synaptic plasticity driven by the sum of the covariances between reward and all past neural activities.

Mesh:

Year:  2006        PMID: 17008410      PMCID: PMC1622804          DOI: 10.1073/pnas.0505220103

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  21 in total

1.  Matching behavior and the representation of value in the parietal cortex.

Authors:  Leo P Sugrue; Greg S Corrado; William T Newsome
Journal:  Science       Date:  2004-06-18       Impact factor: 47.728

Review 2.  Neural coding of basic reward terms of animal learning theory, game theory, microeconomics and behavioural ecology.

Authors:  Wolfram Schultz
Journal:  Curr Opin Neurobiol       Date:  2004-04       Impact factor: 6.627

Review 3.  Indeterminacy in brain and behavior.

Authors:  Paul W Glimcher
Journal:  Annu Rev Psychol       Date:  2005       Impact factor: 24.137

4.  Midbrain dopamine neurons encode a quantitative reward prediction error signal.

Authors:  Hannah M Bayer; Paul W Glimcher
Journal:  Neuron       Date:  2005-07-07       Impact factor: 17.173

5.  Linear-Nonlinear-Poisson models of primate choice dynamics.

Authors:  Greg S Corrado; Leo P Sugrue; H Sebastian Seung; William T Newsome
Journal:  J Exp Anal Behav       Date:  2005-11       Impact factor: 2.468

6.  Dynamic response-by-response models of matching behavior in rhesus monkeys.

Authors:  Brian Lau; Paul W Glimcher
Journal:  J Exp Anal Behav       Date:  2005-11       Impact factor: 2.468

Review 7.  A framework for mesencephalic dopamine systems based on predictive Hebbian learning.

Authors:  P R Montague; P Dayan; T J Sejnowski
Journal:  J Neurosci       Date:  1996-03-01       Impact factor: 6.167

8.  Operant conditioning of cortical unit activity.

Authors:  E E Fetz
Journal:  Science       Date:  1969-02-28       Impact factor: 47.728

9.  Activity in posterior parietal cortex is correlated with the relative subjective desirability of action.

Authors:  Michael C Dorris; Paul W Glimcher
Journal:  Neuron       Date:  2004-10-14       Impact factor: 17.173

Review 10.  Predictive reward signal of dopamine neurons.

Authors:  W Schultz
Journal:  J Neurophysiol       Date:  1998-07       Impact factor: 2.714

View more
  33 in total

1.  A symbolic/subsymbolic interface protocol for cognitive modeling.

Authors:  Patrick Simen; Thad Polk
Journal:  Log J IGPL       Date:  2010-10-01       Impact factor: 0.861

2.  A neural circuit model of flexible sensorimotor mapping: learning and forgetting on multiple timescales.

Authors:  Stefano Fusi; Wael F Asaad; Earl K Miller; Xiao-Jing Wang
Journal:  Neuron       Date:  2007-04-19       Impact factor: 17.173

3.  Learning reward timing in cortex through reward dependent expression of synaptic plasticity.

Authors:  Jeffrey P Gavornik; Marshall G Hussain Shuler; Yonatan Loewenstein; Mark F Bear; Harel Z Shouval
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-03       Impact factor: 11.205

4.  Spatial generalization in operant learning: lessons from professional basketball.

Authors:  Tal Neiman; Yonatan Loewenstein
Journal:  PLoS Comput Biol       Date:  2014-05-22       Impact factor: 4.475

5.  Optimal decision making and matching are tied through diminishing returns.

Authors:  Jan Kubanek
Journal:  Proc Natl Acad Sci U S A       Date:  2017-07-24       Impact factor: 11.205

6.  Striatal action-value neurons reconsidered.

Authors:  Lotem Elber-Dorozko; Yonatan Loewenstein
Journal:  Elife       Date:  2018-05-31       Impact factor: 8.140

7.  Dynamical regimes in neural network models of matching behavior.

Authors:  Kiyohito Iigaya; Stefano Fusi
Journal:  Neural Comput       Date:  2013-09-18       Impact factor: 2.026

8.  Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail.

Authors:  Eleni Vasilaki; Nicolas Frémaux; Robert Urbanczik; Walter Senn; Wulfram Gerstner
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

9.  Synaptic theory of replicator-like melioration.

Authors:  Yonatan Loewenstein
Journal:  Front Comput Neurosci       Date:  2010-06-17       Impact factor: 2.380

10.  Operant generalization in quail neonates after intradimensional training: Distinguishing positive and negative reinforcement.

Authors:  Susan M Schneider; Robert Lickliter
Journal:  Behav Processes       Date:  2009-08-25       Impact factor: 1.777

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.