Literature DB >> 26377457

A Simple Network Architecture Accounts for Diverse Reward Time Responses in Primary Visual Cortex.

Marco A Huertas1, Marshall G Hussain Shuler2, Harel Z Shouval3.   

Abstract

Many actions performed by animals and humans depend on an ability to learn, estimate, and produce temporal intervals of behavioral relevance. Exemplifying such learning of cued expectancies is the observation of reward-timing activity in the primary visual cortex (V1) of rodents, wherein neural responses to visual cues come to predict the time of future reward as behaviorally experienced in the past. These reward-timing responses exhibit significant heterogeneity in at least three qualitatively distinct classes: sustained increase or sustained decrease in firing rate until the time of expected reward, and a class of cells that reach a peak in firing at the expected delay. We elaborate upon our existing model by including inhibitory and excitatory units while imposing simple connectivity rules to demonstrate what role these inhibitory elements and the simple architectures play in sculpting the response dynamics of the network. We find that simply adding inhibition is not sufficient for obtaining the different distinct response classes, and that a broad distribution of inhibitory projections is necessary for obtaining peak-type responses. Furthermore, although changes in connection strength that modulate the effects of inhibition onto excitatory units have a strong impact on the firing rate profile of these peaked responses, the network exhibits robustness in its overall ability to predict the expected time of reward. Finally, we demonstrate how the magnitude of expected reward can be encoded at the expected delay in the network and how peaked responses express this reward expectancy. SIGNIFICANCE STATEMENT: Heterogeneity in single-neuron responses is a common feature of neuronal systems, although sometimes, in theoretical approaches, it is treated as a nuisance and seldom considered as conveying a different aspect of a signal. In this study, we focus on the heterogeneous responses in the primary visual cortex of rodents trained with a predictable delayed reward time. We describe under what conditions this heterogeneity can arise by self-organization, and what information it can convey. This study, while focusing on a specific system, provides insight onto how heterogeneity can arise in general while also shedding light onto mechanisms of reinforcement learning using realistic biological assumptions.
Copyright © 2015 the authors 0270-6474/15/3512659-14$15.00/0.

Entities:  

Keywords:  reinforcement learning; reward; synaptic plasticity; timing; visual cortex

Mesh:

Year:  2015        PMID: 26377457      PMCID: PMC4571602          DOI: 10.1523/JNEUROSCI.0871-15.2015

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  41 in total

1.  A neural correlate of working memory in the monkey primary visual cortex.

Authors:  H Supèr; H Spekreijse; V A Lamme
Journal:  Science       Date:  2001-07-06       Impact factor: 47.728

Review 2.  The neural basis of temporal processing.

Authors:  Michael D Mauk; Dean V Buonomano
Journal:  Annu Rev Neurosci       Date:  2004       Impact factor: 12.449

3.  Conditional modulation of spike-timing-dependent plasticity for olfactory learning.

Authors:  Stijn Cassenaer; Gilles Laurent
Journal:  Nature       Date:  2012-01-25       Impact factor: 49.962

Review 4.  What makes us tick? Functional and neural mechanisms of interval timing.

Authors:  Catalin V Buhusi; Warren H Meck
Journal:  Nat Rev Neurosci       Date:  2005-10       Impact factor: 34.870

5.  Top-down control of motor cortex ensembles by dorsomedial prefrontal cortex.

Authors:  Nandakumar S Narayanan; Mark Laubach
Journal:  Neuron       Date:  2006-12-07       Impact factor: 17.173

6.  Responses of neurons in primary auditory cortex (A1) to pure tones in the halothane-anesthetized cat.

Authors:  Dina Moshitch; Liora Las; Nachum Ulanovsky; Omer Bar-Yosef; Israel Nelken
Journal:  J Neurophysiol       Date:  2006-03-22       Impact factor: 2.714

7.  A synaptic memory trace for cortical receptive field plasticity.

Authors:  Robert C Froemke; Michael M Merzenich; Christoph E Schreiner
Journal:  Nature       Date:  2007-11-15       Impact factor: 49.962

8.  Learning reward timing in cortex through reward dependent expression of synaptic plasticity.

Authors:  Jeffrey P Gavornik; Marshall G Hussain Shuler; Yonatan Loewenstein; Mark F Bear; Harel Z Shouval
Journal:  Proc Natl Acad Sci U S A       Date:  2009-04-03       Impact factor: 11.205

Review 9.  Neural basis of the perception and estimation of time.

Authors:  Hugo Merchant; Deborah L Harrington; Warren H Meck
Journal:  Annu Rev Neurosci       Date:  2013-05-29       Impact factor: 12.449

10.  Functional, but not anatomical, separation of "what" and "when" in prefrontal cortex.

Authors:  Christian K Machens; Ranulfo Romo; Carlos D Brody
Journal:  J Neurosci       Date:  2010-01-06       Impact factor: 6.167

View more
  12 in total

1.  Neurocomputational Models of Interval and Pattern Timing.

Authors:  Nicholas F Hardy; Dean V Buonomano
Journal:  Curr Opin Behav Sci       Date:  2016-02-12

2.  Learning precise spatiotemporal sequences via biophysically realistic learning rules in a modular, spiking network.

Authors:  Ian Cone; Harel Z Shouval
Journal:  Elife       Date:  2021-03-18       Impact factor: 8.140

3.  Timing in the visual cortex and its investigation.

Authors:  Marshall G Hussain Shuler
Journal:  Curr Opin Behav Sci       Date:  2016-04

4.  Reward Timing and Its Expression by Inhibitory Interneurons in the Mouse Primary Visual Cortex.

Authors:  Kevin J Monk; Simon Allard; Marshall G Hussain Shuler
Journal:  Cereb Cortex       Date:  2020-06-30       Impact factor: 5.357

Review 5.  Optogenetic Dissection of the Basal Forebrain Neuromodulatory Control of Cortical Activation, Plasticity, and Cognition.

Authors:  Shih-Chieh Lin; Ritchie E Brown; Marshall G Hussain Shuler; Carl C H Petersen; Adam Kepecs
Journal:  J Neurosci       Date:  2015-10-14       Impact factor: 6.167

6.  Differential Excitability of PV and SST Neurons Results in Distinct Functional Roles in Inhibition Stabilization of Up States.

Authors:  Juan L Romero-Sosa; Helen Motanis; Dean V Buonomano
Journal:  J Neurosci       Date:  2021-07-12       Impact factor: 6.167

7.  Subthreshold basis for reward-predictive persistent activity in mouse prefrontal cortex.

Authors:  Eunyoung Kim; Bilal A Bari; Jeremiah Y Cohen
Journal:  Cell Rep       Date:  2021-05-04       Impact factor: 9.423

8.  Active intrinsic conductances in recurrent networks allow for long-lasting transients and sustained activity with realistic firing rates as well as robust plasticity.

Authors:  Tuba Aksoy; Harel Z Shouval
Journal:  J Comput Neurosci       Date:  2021-10-03       Impact factor: 1.621

9.  The Role of Multiple Neuromodulators in Reinforcement Learning That Is Based on Competition between Eligibility Traces.

Authors:  Marco A Huertas; Sarah E Schwettmann; Harel Z Shouval
Journal:  Front Synaptic Neurosci       Date:  2016-12-15

10.  Prospective Coding by Spiking Neurons.

Authors:  Johanni Brea; Alexisz Tamás Gaál; Robert Urbanczik; Walter Senn
Journal:  PLoS Comput Biol       Date:  2016-06-24       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.