| Literature DB >> 31218428 |
Johan Alsiö1, Benjamin U Phillips2,3, Júlia Sala-Bayo2, Simon R O Nilsson2,4,5, Teresa C Calafat-Pla2, Arazo Rizwand2, Jessica M Plumbridge2, Laura López-Cruz2, Jeffrey W Dalley2,6, Rudolf N Cardinal2,6,7, Adam C Mar2,4,5, Trevor W Robbins2.
Abstract
RATIONALE: Dopamine D2-like receptors (D2R) are important drug targets in schizophrenia and Parkinson's disease, but D2R ligands also cause cognitive inflexibility such as poor reversal learning. The specific role of D2R in reversal learning remains unclear.Entities:
Keywords: Cognition; Cognitive flexibility; Computational modelling; Dopamine; Dopamine D1 receptor; Dopamine D2 receptor; Hierarchical Bayesian analysis; Rat; Reinforcement learning; Reversal learning
Mesh:
Substances:
Year: 2019 PMID: 31218428 PMCID: PMC6695374 DOI: 10.1007/s00213-019-05296-y
Source DB: PubMed Journal: Psychopharmacology (Berl) ISSN: 0033-3158 Impact factor: 4.530
Experimental cohorts and stimuli used in the visual tasks
Stimulus-outcome contingencies were counterbalanced across drug groups. A different probe stimulus (C50/50) exemplar was tested in some rats in the task development experiment (not shown). TSVR, touchscreen serial visual reversal task; VPVD, valence-probe visual discrimination task; PRL, probabilistic reversal learning task (spatial)
Fig. 1Trial structure and learning curves for the three trial types in the valence-probe visual discrimination (VPVD) reversal task. a Example trial sequence. b Regular two-choice trials during both visual discrimination (A+ > B−) and subsequent reversal learning (A− < B+) are interleaved with “probe” trials. During such trials, a third stimulus that is probabilistically linked to reward (50/50% chance of reward/no reward; C50/50) is presented with either the positive or the negative stimulus. c) Rats are below chance on the first day of reversal on all three trial types, indicating the influence of previously learned associations in the form of both stimulus perseveration (preference for previously rewarded stimulus, A−, over C50/50) and learned non-reward (avoiding previously non-rewarded stimulus, B+, when presented with C50/50). Choice behaviour on probe trials over the course of the reversal indicates how much the animals have learned from positive and negative feedback, respectively. Learning curves show mean ± standard error of the mean (SEM)
Latencies to respond at the screen and to collect sucrose pellets on rewarded trials
| Experiment | Response latency (ms) | Collection latency (ms) |
|---|---|---|
| Expt. 2 (VPVD) | ||
| Quinpirole | ||
| Vehicle | 1066 ± 38 | 1529 ± 50 |
| 0.01 mg/kg | 1231 ± 76 | 1786 ± 114 |
| 0.025 mg/kg | 1261 ± 119 | 1916 ± 133* |
| 0.1 mg/kg | 1179 ± 59 | 2257 ± 104*** |
| 0.25 mg/kg | 1183 ± 126 | 2559 ± 118*** |
| 0.5 mg/kg | 1213 ± 93 | 2556 ± 102*** |
| Expt. 3 (VPVD) | ||
| SKF81297 | ||
| Vehicle | 1122 ± 65 | 1379 ± 45 |
| 0.1 mg/kg | 1279 ± 98 | 1436 ± 50 |
| 0.25 mg/kg | 1064 ± 74 | 1612 ± 70** |
| Expt. 4 (TSVR) | ||
| SCH39166 | ||
| Vehicle | 1061 ± 71 | 970 ± 61 |
| 0.025 mg/kg | 1137 ± 69 | 1126 ± 68 |
| 0.05 mg/kg | 1167 ± 72 | 1227 ± 106 |
| 0.1 mg/kg | 1190 ± 69* | 1438 ± 122* |
| Raclopride | ||
| Vehicle | 1247 ± 91 | 968 ± 73 |
| 0.015 mg/kg | 1171 ± 78 | 1013 ± 68 |
| 0.03 mg/kg | 1300 ± 119 | 1168 ± 98* |
| 0.06 mg/kg | 1386 ± 118* | 1392 ± 134* |
| Expt. 4 (VPVD) | ||
| Vehicle | 1047 ± 39 | 1344 ± 45 |
| SCH39166 (0.05) | 1209 ± 72 | 1508 ± 76 |
| Raclopride (0.03) | 1094 ± 66 | 1424 ± 41 |
| Expt. 5 (PRL) | ||
| Quinpirole | ||
| Vehicle | 1155 ± 179 | 1706 ± 72 |
| 0.025 mg/kg | 2810 ± 467*** | 2038 ± 94** |
| 0.1 mg/kg | 4608 ± 639** | 3478 ± 905* |
| 0.25 mg/kg | 5321 ± 1171** | 2879 ± 243** |
VPVD, valence-probe visual discrimination; TSVR, touchscreen serial visual reversal task; PRL, probabilistic reversal learning task. Group data are mean ± standard error of the mean (SEM), collapsed across sessions and reversal phases (Early, Mid, Late) in the TSVR and VPVD task. *p < 0.05; **p < 0.01; ***p < 0.001 vs. vehicle-treated rats in each experiment
Fig. 2The dopamine D2-like receptor agonist quinpirole impaired visual reversal learning in the novel VPVD reversal task (n = 7–8 for each group). a Quinpirole at 0.25 mg/kg reduced correct responses on standard A− < B+ trials. b Quinpirole at 0.25 mg/kg increased the number of errors on standard A− < B+ trials across the learning phases (early: < 11 correct in any 30 trials; late: > 19 correct in any 30 trials, but before the criterion of 24 correct). c There was no effect on performance on the B+ > C50/50 trials, indicating intact learning from positive feedback. d Quinpirole dose-dependently impaired performance on the A− < C50/50 trials, indicating impaired learning from losses. Note that rats treated with quinpirole 0.25 mg/kg fail to improve over chance performance on negative probe trials across the 14 days of testing. Graphs show mean ± SEM for each dose and session
Fig. 3Lack of impact of the D1-receptor agonist SKF81297 on reversal learning in the VPVD task. SKF81297 did not affect learning overall at either 0.1 mg/kg (n = 16) or 0.25 mg/kg (n = 16) vs. vehicle (n = 16). a %Correct on standard A− < B+ trials across the 14 days of treatment. b Numbers of errors on standard A− < B+ trials (probe trials excluded) during early, mid, and late phases of the reversal. c No significant effect of SKF81297 on performance on positive probe trials. d) No effect of SKF81297 on choice behaviour on negative probe trials. Graphs show mean ± SEM for each dose and session
Fig. 4Performance on the VPVD reversal task after dopamine receptor antagonism. D2-like receptor antagonism (raclopride; 0.03 mg/kg; n = 13) and D1-like receptor antagonism (SCH39166; 0.05 mg/kg, n = 14) had no significant effect versus vehicle treatment (saline; n = 20). a %Correct over each of 10 sessions. b No effect on number of errors on standard A− < B+ trials committed during each of three learning phases. c, d Performance on probe trials. Raclopride and SCH39166 did not significantly affect learning overall on either positive (B+ > C50/50) or negative (A− < C50/50) trials. The graphs show mean ± SEM for each dose and session
Fig. 5The D2R agonist quinpirole impaired reversal learning in the PRL task. a Dose-dependent decrease in the number of reversals completed after quinpirole injections (mean ± SEM). b, c Quinpirole impaired both win-stay (b) and lose-shift (c) performance (mean ± SEM). d 95% highest posterior density intervals (HDI) for parameters estimated by hierarchical Bayesian analysis of trial-by-trial choice data from the PRL task. The best model was a reinforcement learning account with separate learning rates for wins (αwin) and losses (αloss), inverse temperature (β), and side stickiness (τ). Quinpirole 0.25 mg/kg impaired learning rate after losses (αloss) without affecting learning rate for wins (αwin). This dose also increased the inverse temperature. e–g Using the winning model, we simulated rats performing the reversal task in silico and updated their expected outcomes (Q values; see Supplementary Online Material) on a trial-by-trial basis using feedback such as probabilistically rewarded responses and reversals after 8 correct responses in a row. For each simulated group (n = 40/dose; graphs show mean ± SEM), parameter values were randomly drawn from the estimated distribution of the actual rats at the corresponding dose. e Dose-dependent decrease in the number of reversals in the simulation. f, g Win-stay and lose-shift analysis of choice data from the simulated rats reveals that the behaviour of the actual rats is recovered by the winning model. *p < 0.05 vs. vehicle; ***p < 0.001 vs vehicle; #the 95% HDI for the difference score (vs. vehicle) excluded zero, i.e. there is a > 95% probability that the drug effect was non-zero
Fig. 6Simulations reveal that the effects of quinpirole 0.25 mg/kg on the αloss parameter is sufficient and necessary to drive reversal learning impairment. a Test for sufficiency of the αloss parameter to drive reversal impairment. In simulated vehicle-treated rats (“All vehicle”; see Fig. 5), impaired reversal learning is observed when the αloss parameter is replaced by values drawn from the estimated distribution of quinpirole 0.25 mg/kg rats (“All veh.:Quinp. αloss”). In contrast, replacing the αloss parameter with β drawn from the distribution of quinpirole 0.25 mg/kg rats (“All veh.: Quinp. β”) actually improves simulated performance, as measured by reversals completed. b The same pattern is observed on win-stay probabilities, where “All veh.:Quinp. αloss” rats perform worse than both “All vehicle” and “All veh.: Quinp. β rats”. c On lose-shift probabilities, simulated vehicle-treated rats with either the αloss or the β drawn from the distribution of quinpirole 0.25 mg/kg rats display impaired performance on the virtual task. d Test for necessity of the αloss parameter to drive reversal impairment. Simulated 0.25 mg/kg quinpirole rats (“All quinpirole 0.25”) perform the virtual task poorly (cf. Figure 5). Replacing the αloss of these simulated rats with the values drawn from the distribution of vehicle-treated rats (“All quinp.:Veh. αloss”) restores performance on the virtual task. In contrast, replacing only the β with values drawn from vehicle rats (“All quinp.:Veh. β”) does not improve performance as measured by the number of reversals. e “All quinp.:Veh. αloss” rats outperform both “All quinpirole 0.25” and “All quinp.:Veh. β” on the win-stay probabilities. f On lose-shift probabilities, there were no differences between “All quinpirole 0.25” rats and the “All quinp.:Veh. αloss” and “All quinpirole 0.25” groups. Graphs show mean ± SEM; n = 40 for each condition