| Literature DB >> 30563856 |
Lieke de Boer1, Jan Axelsson2,3, Rumana Chowdhury4, Katrine Riklund2,3, Raymond J Dolan5, Lars Nyberg2,3,6, Lars Bäckman7, Marc Guitart-Masip7,5.
Abstract
Learning to act to obtain reward and inhibit to avoid punishment is easier compared with learning the opposite contingencies. This coupling of action and valence is often thought of as a Pavlovian bias, although recent research has shown it may also emerge through instrumental mechanisms. We measured this learning bias with a rewarded go/no-go task in 60 adults of different ages. Using computational modeling, we characterized the bias as being instrumental. To assess the role of endogenous dopamine (DA) in the expression of this bias, we quantified DA D1 receptor availability using positron emission tomography (PET) with the radioligand [11C]SCH23390. Using principal-component analysis on the binding potentials in a number of cortical and striatal regions of interest, we demonstrated that cortical, dorsal striatal, and ventral striatal areas provide independent sources of variance in DA D1 receptor availability. Interindividual variation in the dorsal striatal component was related to the strength of the instrumental bias during learning. These data suggest at least three anatomical sources of variance in DA D1 receptor availability separable using PET in humans, and we provide evidence that human dorsal striatal DA D1 receptors are involved in the modulation of instrumental learning biases.Entities:
Keywords: Pavlovian bias; decision making; dopamine; instrumental learning; positron emission tomography
Mesh:
Substances:
Year: 2018 PMID: 30563856 PMCID: PMC6320523 DOI: 10.1073/pnas.1816704116
Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN: 0027-8424 Impact factor: 11.205
Fig. 1.(A) Schematic representation of the rewarded go/no-go task. On each trial, participants were presented with one of four fractals. After a variable delay of 250–2,000 ms, they were presented with a target circle. A “go” was counted as a button press on the same side as the target within 1,500 ms of target presentation. After another delay of 1,500 ms, participants were presented with 80/20 probabilistic feedback. (B) Performance on each trial type. All participants are presented as individual data points in gray. The 95% confidence interval around the mean on each condition is presented in blue color. (C) Model parameters of the winning model were used to generate simulated choice data. The simulated group mean probability of performing a go on each trial is plotted in colored lines (green for go conditions, where go is the correct response; red for no-go conditions, where no-go is the correct response). The group mean for participants’ actual performance is plotted in black lines, reflecting the proportion of actual go responses on each trial. In the plot area, each row represents one participant’s choice behavior. Forty-five pixels, one per trial, make up each row. A white pixel reflects that a participant chose go on that trial; a gray pixel represents no-go.
Model comparison for the six models that were used to account for the behavioral data
| Model no. | Model parameters | No. of parameters | Likelihood | Pseudo- | iBIC |
| 1 | 5 | −3,484 | 0.32 | 7,057 | |
| 2 | 6 | −3,472 | 0.32 | 7,051 | |
| 4 | 6 | −3,464 | 0.32 | 7,042 | |
| 5 | 7 | −3,446 | 0.33 | 7,016 | |
| 6 | 7 | −3,482 | 0.32 | 7,089 |
The winning model statistics are presented in boldface type. Parameters: ε, learning rate; ρwin, weighting of reward on win trials; ρlose, weighting of punishments on lose trials; b, go bias; π, Pavlovian bias; ξ, irreducible noise; κ, instrumental learning bias. iBIC, integrated Bayesian information criterion.
Summary statistics for the parameters in the winning model
| Parameter | Mean | SD | Min | Q0.25 | Median | Q0.75 | Max | Mean old | Mean young | |
| 16.77 | 10.34 | 3.21 | 9.42 | 13.98 | 21.45 | 56.91 | 18.13 | 15.81 | 0.610 | |
| 7.17 | 5.42 | 1.51 | 2.59 | 6.22 | 10.18 | 23.58 | 3.95 | 9.46 | <0.001 | |
| 0.09 | 0.08 | 0.01 | 0.04 | 0.06 | 0.11 | 0.31 | 0.06 | 0.12 | 0.001 | |
| 0.89 | 0.10 | 0.59 | 0.88 | 0.92 | 0.94 | 0.97 | 0.85 | 0.91 | 0.006 | |
| 0.15 | 0.98 | −1.55 | −0.60 | 0.18 | 1.11 | 1.97 | −0.08 | 0.32 | 0.214 | |
| 0.76 | 0.71 | −0.52 | 0.27 | 0.77 | 1.12 | 3.01 | 0.90 | 0.66 | 0.610 |
Parameters: ρwin, weighting of reward on win trials; ρlose, weighting of punishments on lose trials; «, learning rate; ξ, irreducible noise; b, go bias; κ, instrumental learning bonus.
Correlation coefficients for bivariate correlations between age-corrected BPND values in different ROIs and measures of the behavioral bias coupling action and valence
| Measure of behavioral bias | Caudate | Putamen | NAcc | BAs 44, 45, 46, 9 | lOFC/vmPFC | BAs 4, 6 | IPL |
| Instrumental parameter | 0.48** | 0.52*** | 0.26 | 0.43** | 0.39* | 0.28’ | 0.34* |
| No-go to win, % correct | −0.40* | −0.46** | −0.26 | −0.40* | −0.27’ | −0.19 | −0.24 |
| Behavioral effect on win trials | 0.37* | 0.43** | 0.20 | 0.34* | 0.18 | 0.13 | 0.19 |
| Interaction score | 0.30’ | 0.44** | 0.09 | 0.32* | 0.15 | 0.14 | 0.23 |
‘P < 0.1, *P < 0.05, **P < 0.01, and ***P < 0.001.
Fig. 2.Correlation matrix that shows the correlations between age-corrected BPND values in the ROIs selected for analysis.
Component loadings for each ROI
| Region of interest | Component 1 | Component 2 | Component 3 |
| Caudate | 0.39 | 0.18 | |
| Putamen | 0.32 | 0.34 | |
| NAcc | 0.24 | 0.26 | |
| dlPFC/vlPFC: BAs 9, 44, 45, 46 | 0.48 | 0.22 | |
| Limbic PFC: lateral/medial OFC | 0.39 | 0.45 | |
| Premotor PFC: BAs 4, 6 | 0.19 | 0.17 | |
| Parietal cortex: IPL | 0.46 | 0.20 |
The component that each ROI loaded on most strongly is displayed in boldface type.
Fig. 3.Correlations between measures of behavioral bias coupling action and valence and component scores for component 2 (dorsal striatal DA D1 receptor availability). Statistics are displayed in Table 5.
Correlations coefficients and P values for correlations between all component scores and different indicators of a behavioral bias that couples action with valence
| Component 1 scores correlations | Component 2 scores correlations | Component 3 scores correlations | ||||
| Measure of behavioral bias | Correlation coefficient | Adjusted | Correlation coefficient | Adjusted | Correlation coefficient | Adjusted |
| Instrumental parameter | 0.167 | 0.485 | 0.083 | 0.767 | ||
| No-go to win performance | −0.082 | 0.769 | −0.097 | 0.726 | ||
| Behavioral effect on win trials | 0.121 | 0.918 | 0.011 | 0.867 | ||
| Interaction score | 0.065 | 0.813 | −0.091 | 0.994 | ||
Significant correlations are displayed in boldface type.