| Literature DB >> 31824378 |
Leor M Hackel1, Jeffrey J Berg2, Björn R Lindström3, David M Amodio2,3.
Abstract
Do habits play a role in our social impressions? To investigate the contribution of habits to the formation of social attitudes, we examined the roles of model-free and model-based reinforcement learning in social interactions - computations linked in past work to habit and planning, respectively. Participants in this study learned about novel individuals in a sequential reinforcement learning paradigm, choosing financial advisors who led them to high- or low-paying stocks. Results indicated that participants relied on both model-based and model-free learning, such that each type of learning was expressed in both advisor choices and post-task self-reported liking of advisors. Specifically, participants preferred advisors who could provide large future rewards as well as advisors who had provided them with large rewards in the past. Although participants relied more heavily on model-based learning overall, they varied in their use of model-based and model-free learning strategies, and this individual difference influenced the way in which learning related to self-reported attitudes: among participants who relied more on model-free learning, model-free social learning related more to post-task attitudes. We discuss implications for attitudes, trait impressions, and social behavior, as well as the role of habits in a memory systems model of social cognition.Entities:
Keywords: attitude; cognition; computational; habit; learning; model-based; model-free; social
Year: 2019 PMID: 31824378 PMCID: PMC6881302 DOI: 10.3389/fpsyg.2019.02592
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
FIGURE 1Schematic of task design. In the first stage of each round, participants saw one of two sets of advisors and chose an advisor for that round. Participants then viewed the stock that advisor had chosen; after making a button press, participants saw feedback indicating the payout provided by the stock, ranging from zero to nine points. Within each pair of advisors, one advisor always led to the “Axiom” stock and the other always led to the “Zephyr” stock. This feature of the task rendered the two sets of advisors equivalent, such that a model-based learner could apply experiences with one set of advisors to choices involving the other set of advisors.
FIGURE 2Behavioral predictions and data. Plots depict the probability of staying with the same second-stage stock as on the previous trial, based on whether the set of advisors encountered at the first stage was the same as or different from that of the previous trial, and whether the previous trial delivered a high or low reward. (A) Simulated model-based predictions. (B) Simulated model-free predictions. (A,B) Produced from model simulations (see section Supplementary Material) with weighting parameter w specifying fully model-based (w = 1) and fully model-free learning (w = 0), respectively. (C) Observed data indicates the presence of both model-based and model-free learning. Error bars reflect standard error of the mean, adjusted for within-subjects comparisons (Morey, 2008).