Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Literature DB >> 35860954

Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Jaron T Colas^1,2,3, Neil M Dundon^1,4, Raphael T Gerraty^5,6,7, Natalie M Saragosa-Harris^8,9, Karol P Szymula¹⁰, Koranis Tanwisuth^2,11, J Michael Tyszka², Camilla van Geen^6,12, Harang Ju¹³, Arthur W Toga¹⁴, Joshua I Gold¹⁵, Dani S Bassett^{10,16,17,18,19,20}, Catherine A Hartley^8,21, Daphna Shohamy^5,6,22, Scott T Grafton¹, John P O'Doherty^2,3.

Abstract

The model-free algorithms of "reinforcement learning" (RL) have gained clout across disciplines, but so too have model-based alternatives. The present study emphasizes other dimensions of this model space in consideration of associative or discriminative generalization across states and actions. This "generalized reinforcement learning" (GRL) model, a frugal extension of RL, parsimoniously retains the single reward-prediction error (RPE), but the scope of learning goes beyond the experienced state and action. Instead, the generalized RPE is efficiently relayed for bidirectional counterfactual updating of value estimates for other representations. Aided by structural information but as an implicit rather than explicit cognitive map, GRL provided the most precise account of human behavior and individual differences in a reversal-learning task with hierarchical structure that encouraged inverse generalization across both states and actions. Reflecting inference that could be true, false (i.e., overgeneralization), or absent (i.e., undergeneralization), state generalization distinguished those who learned well more so than action generalization. With high-resolution high-field fMRI targeting the dopaminergic midbrain, the GRL model's RPE signals (alongside value and decision signals) were localized within not only the striatum but also the substantia nigra and the ventral tegmental area, including specific effects of generalization that also extend to the hippocampus. Factoring in generalization as a multidimensional process in value-based learning, these findings shed light on complexities that, while challenging classic RL, can still be resolved within the bounds of its core computations.

Entities: Chemical

Keywords: cognitive map; counterfactual learning; dopaminergic midbrain; generalization; hippocampus; individual differences; model-free and model-based; multifield fMRI; reinforcement learning; striatum

Mesh：

Year: 2022 PMID： 35860954 PMCID： PMC9491297 DOI： 10.1002/hbm.25988

Source DB: PubMed Journal: Hum Brain Mapp ISSN： 1065-9471 Impact factor: 5.399

256 in total

1. Generalized autocalibrating partially parallel acquisitions (GRAPPA).

Authors: Mark A Griswold; Peter M Jakob; Robin M Heidemann; Mathias Nittka; Vladimir Jellus; Jianmin Wang; Berthold Kiefer; Axel Haase
Journal: Magn Reson Med Date: 2002-06 Impact factor: 4.668

2. A direct demonstration of functional specialization in human visual cortex.

Authors: S Zeki; J D Watson; C J Lueck; K J Friston; C Kennard; R S Frackowiak
Journal: J Neurosci Date: 1991-03 Impact factor: 6.167

3. Dopamine-mediated reinforcement learning signals in the striatum and ventromedial prefrontal cortex underlie value-based choices.

Authors: Gerhard Jocham; Tilmann A Klein; Markus Ullsperger
Journal: J Neurosci Date: 2011-02-02 Impact factor: 6.167

4. A mechanistic account of value computation in the human brain.

Authors: Marios G Philiastides; Guido Biele; Hauke R Heekeren
Journal: Proc Natl Acad Sci U S A Date: 2010-05-03 Impact factor: 11.205

5. Area V5 of the human brain: evidence from a combined study using positron emission tomography and magnetic resonance imaging.

Authors: J D Watson; R Myers; R S Frackowiak; J V Hajnal; R P Woods; J C Mazziotta; S Shipp; S Zeki
Journal: Cereb Cortex Date: 1993 Mar-Apr Impact factor: 5.357

Review 6. Model-based predictions for dopamine.

Authors: Angela J Langdon; Melissa J Sharpe; Geoffrey Schoenbaum; Yael Niv
Journal: Curr Opin Neurobiol Date: 2017-10-31 Impact factor: 6.627

7. Generalization guides human exploration in vast decision spaces.

Authors: Charley M Wu; Eric Schulz; Maarten Speekenbrink; Jonathan D Nelson; Björn Meder
Journal: Nat Hum Behav Date: 2018-11-12

Review 8. Beyond dichotomies in reinforcement learning.

Authors: Anne G E Collins; Jeffrey Cockburn
Journal: Nat Rev Neurosci Date: 2020-09-01 Impact factor: 34.870

Review 9. Why and how the brain weights contributions from a mixture of experts.

Authors: John P O'Doherty; Sang Wan Lee; Reza Tadayonnejad; Jeff Cockburn; Kyo Iigaya; Caroline J Charpentier
Journal: Neurosci Biobehav Rev Date: 2021-01-11 Impact factor: 8.989

10. Brain tissue segmentation based on MP2RAGE multi-contrast images in 7 T MRI.

Authors: Uk-Su Choi; Hirokazu Kawaguchi; Yuichiro Matsuoka; Tobias Kober; Ikuhiro Kida
Journal: PLoS One Date: 2019-02-28 Impact factor: 3.240

1 in total

1. Reinforcement learning with associative or discriminative generalization across states and actions: fMRI at 3 T and 7 T.

Authors: Jaron T Colas; Neil M Dundon; Raphael T Gerraty; Natalie M Saragosa-Harris; Karol P Szymula; Koranis Tanwisuth; J Michael Tyszka; Camilla van Geen; Harang Ju; Arthur W Toga; Joshua I Gold; Dani S Bassett; Catherine A Hartley; Daphna Shohamy; Scott T Grafton; John P O'Doherty
Journal: Hum Brain Mapp Date: 2022-07-21 Impact factor: 5.399

1 in total