| Literature DB >> 34302222 |
Anni Richter1, Lieke de Boer2,3, Marc Guitart-Masip2,4, Gusalija Behnisch5, Constanze I Seidenbecher5,6, Björn H Schott5,6,7,8,9.
Abstract
Dopaminergic neurotransmission plays a pivotal role in appetitively motivated behavior in mammals, including humans. Notably, action and valence are not independent in motivated tasks, and it is particularly difficult for humans to learn the inhibition of an action to obtain a reward. We have previously observed that the carriers of the DRD2/ANKK1 TaqIA A1 allele, that has been associated with reduced striatal dopamine D2 receptor expression, showed a diminished learning performance when required to learn response inhibition to obtain rewards, a finding that was replicated in two independent cohorts. With our present study, we followed two aims: first, we aimed to replicate our finding on the DRD2/ANKK1 TaqIA polymorphism in a third independent cohort (N = 99) and to investigate the nature of the genetic effects more closely using trial-by-trial behavioral analysis and computational modeling in the combined dataset (N = 281). Second, we aimed to assess a potentially modulatory role of prefrontal dopamine availability, using the widely studied COMT Val108/158Met polymorphism as a proxy. We first report a replication of the above mentioned finding. Interestingly, after combining all three cohorts, exploratory analyses regarding the COMT Val108/158Met polymorphism suggest that homozygotes for the Met allele, which has been linked to higher prefrontal dopaminergic tone, show a lower learning bias. Our results corroborate the importance of genetic variability of the dopaminergic system in individual learning differences of action-valence interaction and, furthermore, suggest that motivational learning biases are differentially modulated by genetic determinants of striatal and prefrontal dopamine function.Entities:
Keywords: Action bias; COMT; Dopamine D2 receptor; Motivated learning; Reward learning; TaqIA
Mesh:
Substances:
Year: 2021 PMID: 34302222 PMCID: PMC8536632 DOI: 10.1007/s00702-021-02382-4
Source DB: PubMed Journal: J Neural Transm (Vienna) ISSN: 0300-9564 Impact factor: 3.575
Fig. 4A model of genetically driven contributions to the coupling of action and valence during learning. DA neurons signal positive reward prediction errors by phasic bursts and negative prediction errors by dips below baseline firing rate. While the first reinforces the direct pathway via activation of D1 receptors and thereby facilitates the future generation of go choices, the second reinforces the indirect pathway via reduced activation of D2 receptors and thus facilitates the future generation of no-go choices in comparable situations. A1 carriers would be assumed to have reduced D2 receptor-binding capacity decreasing autoinhibition of dopaminergic signaling after negative prediction errors in the indirect pathway and a shift to a more action-oriented behavioral pattern mediated by the direct pathway. COMT Val108/158Met Met carriers would be assumed to have higher frontal DA availability facilitating working memory and attentional processes. Moreover, indirect downstream effects on striatal DA regulation may add on improving performance under Pavlovian conflict in Met compared to Val homozygotes. The MNI template brain from MRIcroGL (“mni152”) was used in this illustration.
Figure adapted from Richter et al. (2014)
Fig. 1Experimental paradigm and participant performance. A Probabilistic monetary go/no-go task. Fractal cues indicate the condition—a combination of action (go or no-go) and valence (reward or punishment). On go trials, subjects press a button for the side of a circle. On no-go trials, they withhold a response. Arrows indicate rewards (upward) or punishments (downward). Horizontal bars symbolize the absence of a reward or punishment. ITI, intertrial interval. B The schematics represent for each condition the nomenclature (left), the possible outcomes and their probabilities after a go response (middle), and the possible outcomes and their probability after a no-go response (right). C Simulated choice data according to the model parameters of the winning model. Colored lines represent the simulated group mean probability of performing a go on each trial (green for go conditions, where go is the correct response; red for no-go conditions, where no-go is the correct response). Black lines indicate the group mean for participants’ actual go responses on each trial. In the plot area, each row represents one participant’s choice behavior for each trial (281 × 60 pixels). A white pixel reflects that a participant chose go on that trial; a gray pixel represents no-go. Participants made more go responses to win vs. avoid losing cues, reflecting the motivational bias. Overall, they successfully learned whether to make a go response or not (proportion of go responses increases for go cues and decreases for no-go cues). Figures (A) and (B) adapted from Richter et al. (2014)
Integrated Bayesian information criteria (iBIC) for tested models
| Model no | Model parameters | No. of parameters | Likelihood | Pseudo- | iBIC |
|---|---|---|---|---|---|
| 1 | 2 | − 23,463 | 0.498 | 46,970 | |
| 2 | 3 | − 23,314 | 0.501 | 46,695 | |
| 3 | 4 | − 21,798 | 0.534 | 43,685 | |
| 4 | 5 | − 21,334 | 0.544 | 42,779 | |
| 5 | 6 | − 21,137 | 0.548 | 42,406 | |
| − |
Boldface type: winning model statistics, ε: learning rate, ρ: weighting of reward on win trials, ρ: weighting of punishments on lose trials. ξ: irreducible noise, b: go bias, π: Pavlovian bias, iBIC: integrated Bayesian information criterion (smaller iBIC values indicate a better model fit). Descriptives for the parameters in the winning model (M ± SD): ε = 0.26 ± 0.15, ρ = 15.32 ± 13.30, ρ = 7.51 ± 4.03, ξ = 0.96 ± 0.06, b = 1.10 ± 0.74, π = 0.65 ± 0.57
Fig. 2Effects of DRD2/ANKK1 TaqIA genotype on choice performance. A and B Effects of DRD2/ANKK1 TaqIA genotype on choice performance in the third cohort (N = 99) and in the entire sample (N = 281). Compared to the A2 homozygotes, A1 carriers showed a diminished learning to withhold an action to receive a reward. Left panels: bar plots show mean differences between correct response rates (± SEM) during second half versus the first half of trials for each condition. This score represents the observed fourfold interaction of action × valence × time × genotype. Right panels: line charts show mean values of correct responses (± SEM) in the first and the second half of trials for all four conditions. Post hoc comparisons via t tests: *p < 0.05, ***p < 0.001. C Trial-by-trial proportions of go responses (± SEM) to go cues (solid lines) and no-go cues (dashed lines) across cue types. Win and avoid losing condition seperately and colors depict DRD2/ANKK1 TaqIA genotypes. TaqIA A1 carriers showed an enhanced effect of cue valence on go responding especially in the no-go to win condition with further progress of the experiment (lines are mostly separated). Adapted scripts of Swart et al. (2017) were used to generate figures
Descriptive data of the entire sample regarding DRD2/ANKK1 TaqIA and COMT Val108/158Met genotypes
| DRD2/ANKK1 TaqIA | A1 + | A1 − | A1 + > A1 − |
|---|---|---|---|
| Gender ( | 44/55 | 102/80 | |
| Age in years (M ± SD) | 25.1 ± 3.1 | 24.6 ± 2.6 | |
| Non-smokers/smokers ( | 70/29 | 143/39 | |
| COMT ( | 30/45/24 | 53/83/46 |
Demographic data are pooled across all three cohorts (cohort 1 and 2 from Richter et al. (2014), and the newly investigated cohort 3). N = number, M = mean, SD = standard deviation, VM: Val/Met heterozygotes, VV: Val homozygotes, A1 + : carriers of the A1 allele, A1 − : A2 homozygotes
Fig. 3Effects of COMT genotype on choice performance in the entire sample. Left panels: bar plots show mean differences between correct response rates (± SEM) during second half versus the first half of trials for each condition. This score represents the observed fourfold interaction of action × valence × time × genotype. Right panels: line charts show mean values of correct responses (± SEM) in the first and the second half of trials for all four conditions. Met homozygotes showed increased learning throughout the experiment in the no-go to win and go avoid losing condition relative to heterozygotes. Post hoc comparisons via t tests: *p < 0.05