Literature DB >> 27870610

Neural Circuits Trained with Standard Reinforcement Learning Can Accumulate Probabilistic Information during Decision Making.

Nils Kurzawa1, Christopher Summerfield2, Rafal Bogacz3.   

Abstract

Much experimental evidence suggests that during decision making, neural circuits accumulate evidence supporting alternative options. A computational model well describing this accumulation for choices between two options assumes that the brain integrates the log ratios of the likelihoods of the sensory inputs given the two options. Several models have been proposed for how neural circuits can learn these log-likelihood ratios from experience, but all of these models introduced novel and specially dedicated synaptic plasticity rules. Here we show that for a certain wide class of tasks, the log-likelihood ratios are approximately linearly proportional to the expected rewards for selecting actions. Therefore, a simple model based on standard reinforcement learning rules is able to estimate the log-likelihood ratios from experience and on each trial accumulate the log-likelihood ratios associated with presented stimuli while selecting an action. The simulations of the model replicate experimental data on both behavior and neural activity in tasks requiring accumulation of probabilistic cues. Our results suggest that there is no need for the brain to support dedicated plasticity rules, as the standard mechanisms proposed to describe reinforcement learning can enable the neural circuits to perform efficient probabilistic inference.

Entities:  

Year:  2016        PMID: 27870610      PMCID: PMC5462093          DOI: 10.1162/NECO_a_00917

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  30 in total

1.  Regulation of parkinsonian motor behaviours by optogenetic control of basal ganglia circuitry.

Authors:  Alexxai V Kravitz; Benjamin S Freeze; Philip R L Parker; Kenneth Kay; Myo T Thwin; Karl Deisseroth; Anatol C Kreitzer
Journal:  Nature       Date:  2010-07-07       Impact factor: 49.962

2.  The basal ganglia and cortex implement optimal decision making between alternative actions.

Authors:  Rafal Bogacz; Kevin Gurney
Journal:  Neural Comput       Date:  2007-02       Impact factor: 2.026

3.  A neurobiological theory of automaticity in perceptual categorization.

Authors:  F Gregory Ashby; John M Ennis; Brian J Spiering
Journal:  Psychol Rev       Date:  2007-07       Impact factor: 8.934

Review 4.  A neural substrate of prediction and reward.

Authors:  W Schultz; P Dayan; P R Montague
Journal:  Science       Date:  1997-03-14       Impact factor: 47.728

5.  A cellular mechanism of reward-related learning.

Authors:  J N Reynolds; B I Hyland; J R Wickens
Journal:  Nature       Date:  2001-09-06       Impact factor: 49.962

6.  Caudate encodes multiple computations for perceptual decisions.

Authors:  Long Ding; Joshua I Gold
Journal:  J Neurosci       Date:  2010-11-24       Impact factor: 6.167

7.  Opponent actor learning (OpAL): modeling interactive effects of striatal dopamine on reinforcement learning and choice incentive.

Authors:  Anne G E Collins; Michael J Frank
Journal:  Psychol Rev       Date:  2014-07       Impact factor: 8.934

Review 8.  Do humans make good decisions?

Authors:  Christopher Summerfield; Konstantinos Tsetsos
Journal:  Trends Cogn Sci       Date:  2014-12-06       Impact factor: 20.229

9.  Action selection performance of a reconfigurable basal ganglia inspired model with Hebbian-Bayesian Go-NoGo connectivity.

Authors:  Pierre Berthet; Jeanette Hellgren-Kotaleski; Anders Lansner
Journal:  Front Behav Neurosci       Date:  2012-10-02       Impact factor: 3.558

10.  Distinct roles of dopamine and subthalamic nucleus in learning and probabilistic decision making.

Authors:  Elizabeth J Coulthard; Rafal Bogacz; Shazia Javed; Lucy K Mooney; Gillian Murphy; Sophie Keeley; Alan L Whone
Journal:  Brain       Date:  2012-10-31       Impact factor: 13.501

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.