| Literature DB >> 24391615 |
Inti A Brazil1, Laurence T Hunt2, Berend H Bulten3, Roy P C Kessels4, Ellen R A de Bruijn5, Rogier B Mars6.
Abstract
Psychopathy is often linked to disturbed reinforcement-guided adaptation of behavior in both clinical and non-clinical populations. Recent work suggests that these disturbances might be due to a deficit in actively using information to guide changes in behavior. However, how much information is actually used to guide behavior is difficult to observe directly. Therefore, we used a computational model to estimate the use of information during learning. Thirty-six female subjects were recruited based on their total scores on the Psychopathic Personality Inventory (PPI), a self-report psychopathy list, and performed a task involving simultaneous learning of reward-based and social information. A Bayesian reinforcement-learning model was used to parameterize the use of each source of information during learning. Subsequently, we used the subscales of the PPI to assess psychopathy-related traits, and the traits that were strongly related to the model's parameters were isolated through a formal variable selection procedure. Finally, we assessed how these covaried with model parameters. We succeeded in isolating key personality traits believed to be relevant for psychopathy that can be related to model-based descriptions of subject behavior. Use of reward-history information was negatively related to levels of trait anxiety and fearlessness, whereas use of social advice decreased as the perceived ability to manipulate others and lack of anxiety increased. These results corroborate previous findings suggesting that sub-optimal use of different types of information might be implicated in psychopathy. They also further highlight the importance of considering the potential of computational modeling to understand the role of latent variables, such as the weight people give to various sources of information during goal-directed behavior, when conducting research on psychopathy-related traits and in the field of forensic psychiatry.Entities:
Keywords: associative learning; computational modeling; individual differences; personality traits; psychopathic traits; psychopathy; reinforcement learning; social learning
Year: 2013 PMID: 24391615 PMCID: PMC3868018 DOI: 10.3389/fpsyg.2013.00952
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Mean total PPI score and subscale scores for the experimental sample (.
| Age | 22.8 (6.4) |
| Total PPI score | 336 (47.8) |
| Stress immunity | 28.9 (5.4) |
| Social potency | 56.6 (12.8) |
| Fearlessness | 41.3 (10.0) |
| Coldheartedness | 46.4 (7.3) |
| Blame externalization | 31.8 (6.9) |
| Carefree non-planfulness | 40.4 (5.7) |
| Machavellian egocentricity | 54.6 (12.0) |
| Impulsive non-conformity | 33.6 (6.3) |
Figure 1(A) Sequence of events and their timings during the experiment. (B) Probability of reward from choosing green card through the experiment. The line shows the probability estimated by the Bayesian reinforcement learning model. (C) The figure shows the model-derived probability of the confederate providing the correct answer through the experiment. Note that the model learns independently about both social and reward history information at the time feedback is received.
Figure 2Graphical depiction of the γ parameter in the model. (See equations 1 and 2, section Mathematical model description, for algebraic description). (A) Example transform between objective (RL model-derived) probability and subjective probability, parameterized by γ. As γ increases, small differences in the “objective” probability (tracked by the model) are amplified to have a greater influence on subject behavior. (B) Posterior probability of choosing green for varying levels of γreward history and γsocial, for one example trial, where reward history and advice are equally relevant, but suggest conflicting responses (reward history suggests blue choices, advice is to pick green). When γsocial = γreward history (diagonal), subject is equally likely to pick blue or green; when γsocial > γreward history, subject is more likely to pick green; when γsocial < γreward history, subject is more likely to pick blue. See section Modeling for details.
Figure 3Results of the variable selection procedure for γ. The maximum standardized sum of coefficients (SSC; x-axis) was set at 1.0, representing 100% of the contribution of the PPI scales to the corresponding γ parameter. Each sub-figure should be read from right (SSC = 1.0) to left (SSC = 0.0). The variable coefficients (y-axis) are displayed for different stages of shrinkage of the SSC. For each analysis, the variables included in the optimal model (i.e., the model with the lowest expected prediction error) are indicated with the vertical dashed line.
Figure 4Left: scatterplots for the correlations between γreward history and Stress Immunity (top left)/Fearlessness (bottom left). Right: scatterplots for the correlations between γsocial and Stress Immunity (top right)/Social Potency (bottom right).