| Literature DB >> 31273400 |
Trevor W Robbins1,2, Rudolf N Cardinal3,4,5.
Abstract
RATIONALE: Psychopharmacology needs novel quantitative measures and theoretical approaches based on computational modelling that can be used to help translate behavioural findings from experimental animals to humans, including patients with neuropsychiatric disorders.Entities:
Keywords: Computer modelling; Depression; Dopamine; Reinforcement learning; Schizophrenia; Serotonin
Mesh:
Substances:
Year: 2019 PMID: 31273400 PMCID: PMC6695356 DOI: 10.1007/s00213-019-05302-3
Source DB: PubMed Journal: Psychopharmacology (Berl) ISSN: 0033-3158 Impact factor: 4.530
Fig. 1Typical visual stimuli and reinforcement contingencies employed for human studies of probabilistic learning and reversal in discrete trial procedures involving forced choices between option A and option B. Participants are instructed to obtain the most rewards as possible (best achieved here by choosing exclusively the 75% rewarded stimulus). Rewarding outcomes are denoted by brief immediate feedback from the happy face presentation, and punishing outcomes by the frowning face. Following attainment of a learning criterion over a suitable number of trials, the contingencies may be reversed without warning. Such paradigms can be employed to model reinforcement learning in humans and experimental animals. The actual probabilities of reinforcement may vary from study to study. Taken from Cools et al. (2002)
Fig. 2Effects of sulpiride on reinforcement learning model parameters in human volunteers. Effects were restricted to ‘temperature’ (β) rather than learning rate (α), were for gains only, and were exaggerated by higher plasma levels of sulpiride and in participants with the A1+ genotype of the Taq1A polymorphism for DA D2 receptors. Parameter estimates of the Q-learning model were derived across drug, serum value and genotype groups, separately for the gain and loss domain. a The temperature parameter βgain was significantly higher in the sulpiride group (57% increase compared to the placebo, P = 0.005), but the learning rate αgain was not affected, and there were no effects in the loss domain (αloss, βloss). b Higher sulpiride serum values selectively affected the temperature parameter gain (183% increase in high compared to low serum values, P = 0.001), with no effects on either αgain, αloss or βloss. c Pronounced sulpiride effects on βgain were observed in A1+ genotype carriers (211% increase following sulpiride compared to placebo administration, P < 0.001), but not in A1–genotype carriers. Reproduced from Eisenegger et al. (2014) with permission of the publishers
Effects of 5-HT manipulations on probabilistic spatial reversal in rats
Fig. 3Retrodicting actual behavioural data in probabilistic learning by marmoset monkeys by the ‘winning’ (best-fitting) reinforcement learning model (right). Open squares, sham-operated group. Filled circles, amygdala 5-HT depleted group. Filled triangles, orbitofrontal 5-HT depleted group. The amygdala 5-HT depleted group exhibited impairments in responding to both ‘misleading’ and ‘truthful’ feedback, which were accurately modelled as a deficit in reinforcement sensitivity. Reproduced from Rygula et al. (2014)