Literature DB >> 18264774

A model of reward choice based on the theory of reinforcement learning.

I A Smirnitskaya1, A A Frolov, G Kh Merzhanova.   

Abstract

A model explaining behavioral "impulsivity" and "self-control" is proposed on the basis of the theory of reinforcement learning. The discount coefficient gamma, which in this theory accounts for the subjective reduction in the value of a delayed reinforcement, is identified with the overall level of dopaminergic neuron activity which, according to published data, also determines the behavioral variant. Computer modeling showed that high values of gamma are characteristic of predominantly "self-controlled" subjects, while smaller values of gamma are characteristic of "impulsive" subjects.

Mesh:

Substances:

Year:  2008        PMID: 18264774     DOI: 10.1007/s11055-008-0039-6

Source DB:  PubMed          Journal:  Neurosci Behav Physiol        ISSN: 0097-0549


  27 in total

Review 1.  The basal ganglia: a vertebrate solution to the selection problem?

Authors:  P Redgrave; T J Prescott; K Gurney
Journal:  Neuroscience       Date:  1999       Impact factor: 3.590

2.  Temporal difference model reproduces anticipatory neural activity.

Authors:  R E Suri; W Schultz
Journal:  Neural Comput       Date:  2001-04       Impact factor: 2.026

3.  Local and distributed neural networks and individuality.

Authors:  G Kh Merzhanova
Journal:  Neurosci Behav Physiol       Date:  2003-02

Review 4.  Computational roles for dopamine in behavioural control.

Authors:  P Read Montague; Steven E Hyman; Jonathan D Cohen
Journal:  Nature       Date:  2004-10-14       Impact factor: 49.962

5.  Choice between rewards differing in amount and delay: Toward a choice model of self control.

Authors:  L Green; M Snyderman
Journal:  J Exp Anal Behav       Date:  1980-09       Impact factor: 2.468

6.  Choice with delayed and probabilistic reinforcers: effects of prereinforcer and postreinforcer stimuli.

Authors:  J E Mazur
Journal:  J Exp Anal Behav       Date:  1998-11       Impact factor: 2.468

Review 7.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

8.  Waiting for rewards and punishments: effects of time and probability on choice.

Authors:  W Mischel; J Grusec
Journal:  J Pers Soc Psychol       Date:  1967-01

Review 9.  Predictive reward signal of dopamine neurons.

Authors:  W Schultz
Journal:  J Neurophysiol       Date:  1998-07       Impact factor: 2.714

Review 10.  Dopamine hypofunction possibly results from a defect in glutamate-stimulated release of dopamine in the nucleus accumbens shell of a rat model for attention deficit hyperactivity disorder--the spontaneously hypertensive rat.

Authors:  Vivienne Ann Russell
Journal:  Neurosci Biobehav Rev       Date:  2003-11       Impact factor: 8.989

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.