| Literature DB >> 34675294 |
Takeyuki Oba1, Kentaro Katahira2, Hideki Ohira2.
Abstract
People tend to avoid risk in the domain of gains but take risks in the domain of losses; this is called the reflection effect. Formal theories of decision-making have provided important perspectives on risk preferences, but how individuals acquire risk preferences through experiences remains unknown. In the present study, we used reinforcement learning (RL) models to examine the learning processes that can shape attitudes toward risk in both domains. In addition, relationships between learning parameters and personality traits were investigated. Fifty-one participants performed a learning task, and we examined learning parameters and risk preference in each domain. Our results revealed that an RL model that included a nonlinear subjective utility parameter and differential learning rates for positive and negative prediction errors exhibited better fit than other models and that these parameters independently predicted risk preferences and the reflection effect. Regarding personality traits, although the sample sizes may be too small to test personality traits, increased primary psychopathy scores could be linked with decreased learning rates for positive prediction error in loss conditions among participants who had low anxiety traits. The present findings not only contribute to understanding how decision-making in risky conditions is influenced by past experiences but also provide insights into certain psychiatric problems.Entities:
Year: 2021 PMID: 34675294 PMCID: PMC8531311 DOI: 10.1038/s41598-021-00358-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Learning task used in this experiment. A sample of a gain trial is depicted.
The iBIC values and the negative log marginal likelihoods for the models using all choice data for each participant.
| Models | Set of parameters | iBIC (-LML) |
|---|---|---|
| Standard model | 16,868.56 (8414.34) | |
| Separate learning rate model | 16,490.25 (8215.21) | |
| Subjective utility model | 16,570.56 (8255.37) | |
| Hybrid model | 16,150.33 (8035.28) | |
| Hybrid model (split β into the domains) | 16,131.21 (8015.74) | |
| Hybrid model (split α into the domains) | 15,934.95 (7907.64) | |
| Hybrid model (split κ into the domains) | 15,999.22 (7949.75) | |
| Hybrid model (split all parameters into the domains) | 15,697.01 (7768.73) |
Note α = learning rate, β = inverse temperature, κ = subjective utility parameter, -LML = negative log marginal likelihood. The subscripts represent the following: P = positive PE, N = negative PE, G = gain domain, L = loss domain.
Figure 2Average probabilities of choosing an advantageous option for each trial type and the model predictions. The solid lines indicate the proportions of participants choosing a better option in each trial across participants, and the dashed lines show the predictions of the winning model. The shaded areas represent the 95% confidence intervals of the model prediction.
Figure 3Correlations of the proportion of choosing the risky option with the contrast between learning rates for signed PEs and the subjective utility parameters. The left panels indicate the correlations with the subjective utility parameters, while the right panels show the correlations with the contrast between learning rates for positive and negative PEs.
The correlations between the learning parameters and the personality traits.
| Descriptive statistics of the personality traits | Gain | Loss | |||||||
|---|---|---|---|---|---|---|---|---|---|
| κ | κ | ||||||||
| PP | − 0.144 | 0.134 | 0.122 | − 0.033 | 0.012 | 0.043 | 0.069 | 0.073 | |
| SP | − 0.016 | 0.043 | − 0.016 | − 0.013 | 0.277* | − 0.013 | − 0.062 | − 0.026 | |
| TA | 0.012 | 0.228 | 0.282* | − 0.136 | 0.107 | 0.009 | 0.114 | 0.061 | |
Note PP = primary psychopathy, SP = secondary psychopathy, TA = trait anxiety, αP = learning rate for positive PE, αN = learning rate for negative PE, β = inverse temperature, κ = subjective utility parameter. *p < 0.05.
Results of hierarchical regression analyses.
| Step 1 | Step 2 | Δ | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PP | SP | TA | PP | SP | TA | PP × SP | PP × TA | SP × TA | ||||
| Standardized coefficients | Standardized coefficients | |||||||||||
| Gain | − 0.175 | 0.069 | − 0.013 | 0.024 | − 0.222 | − 0.008 | 0.019 | − 0.048 | 0.129 | 0.196 | 0.063 | |
| 0.212 | − 0.210 | 0.321† | 0.092 | 0.196 | − 0.230 | 0.385* | − 0.238 | 0.004 | − 0.095 | 0.071 | ||
| 0.256 | − 0.346† | 0.440** | 0.161* | 0.234 | − 0.394* | 0.490** | − 0.154 | 0.099 | − 0.014 | 0.015 | ||
| κ | − 0.074 | 0.112 | − 0.188 | 0.027 | − 0.104 | 0.156 | − 0.225 | 0.112 | − 0.302 | 0.293† | 0.083 | |
| Loss | − 0.157 | 0.384* | − 0.074 | 0.097 | − 0.183 | 0.279 | − 0.061 | 0.052 | 0.337† | 0.136 | 0.187* | |
| 0.069 | − 0.062 | 0.036 | 0.004 | 0.103 | 0.015 | − 0.019 | 0.143 | − 0.176 | − 0.046 | 0.027 | ||
| 0.170 | − 0.251 | 0.229 | 0.054 | 0.129 | − 0.376† | 0.354* | − 0.379* | 0.329† | − 0.115 | 0.108 | ||
| κ | 0.133 | − 0.149 | 0.128 | 0.021 | 0.086 | − 0.208 | 0.214 | − 0.283 | 0.037 | 0.031 | 0.060 | |
Note PP = primary psychopathy, SP = secondary psychopathy, TA = trait anxiety, αP = learning rate for positive PE, αN = learning rate for negative PE, β = inverse temperature, κ = subjective utility parameter. †p < 0.10, *p < 0.05, **p < 0.01.
Figure 4The relationship between the learning rate for positive PE in the domain of losses (αLP) and primary psychopathy scores interacted with anxiety traits.