Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning.

Literature DB >> 18446432

Path-finding in real and simulated rats: assessing the influence of path characteristics on navigation learning.

Minija Tamosiunaite¹, James Ainge, Tomas Kulvicius, Bernd Porr, Paul Dudchenko, Florentin Wörgötter.

Abstract

A large body of experimental evidence suggests that the hippocampal place field system is involved in reward based navigation learning in rodents. Reinforcement learning (RL) mechanisms have been used to model this, associating the state space in an RL-algorithm to the place-field map in a rat. The convergence properties of RL-algorithms are affected by the exploration patterns of the learner. Therefore, we first analyzed the path characteristics of freely exploring rats in a test arena. We found that straight path segments with mean length 23 cm up to a maximal length of 80 cm take up a significant proportion of the total paths. Thus, rat paths are biased as compared to random exploration. Next we designed a RL system that reproduces these specific path characteristics. Our model arena is covered by overlapping, probabilistically firing place fields (PF) of realistic size and coverage. Because convergence of RL-algorithms is also influenced by the state space characteristics, different PF-sizes and densities, leading to a different degree of overlap, were also investigated. The model rat learns finding a reward opposite to its starting point. We observed that the combination of biased straight exploration, overlapping coverage and probabilistic firing will strongly impair the convergence of learning. When the degree of randomness in the exploration is increased, convergence improves, but the distribution of straight path segments becomes unrealistic and paths become 'wiggly'. To mend this situation without affecting the path characteristic two additional mechanisms are implemented: a gradual drop of the learned weights (weight decay) and path length limitation, which prevents learning if the reward is not found after some expected time. Both mechanisms limit the memory of the system and thereby counteract effects of getting trapped on a wrong path. When using these strategies individually divergent cases get substantially reduced and for some parameter settings no divergence was found anymore at all. Using weight decay and path length limitation at the same time, convergence is not much improved but instead time to convergence increases as the memory limiting effect is getting too strong. The degree of improvement relies also on the size and degree of overlap (coverage density) in the place field system. The used combination of these two parameters leads to a trade-off between convergence and speed to convergence. Thus, this study suggests that the role of the PF-system in navigation learning cannot be considered independently from the animals' exploration pattern.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2008 PMID： 18446432 PMCID： PMC3085791 DOI： 10.1007/s10827-008-0094-6

Source DB: PubMed Journal: J Comput Neurosci ISSN： 0929-5313 Impact factor: 1.621

44 in total

1. How the basal ganglia use parallel excitatory and inhibitory learning pathways to selectively respond to unexpected rewarding cues.

Authors: J Brown; D Bullock; S Grossberg
Journal: J Neurosci Date: 1999-12-01 Impact factor: 6.167

2. A predictive reinforcement model of dopamine neurons for learning approach behavior.

Authors: J L Contreras-Vidal; W Schultz
Journal: J Comput Neurosci Date: 1999 May-Jun Impact factor: 1.621

3. A model of hippocampally dependent navigation, using the temporal difference learning rule.

Authors: D J Foster; R G Morris; P Dayan
Journal: Hippocampus Date: 2000 Impact factor: 3.899

4. Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity.

Authors: A Arleo; W Gerstner
Journal: Biol Cybern Date: 2000-09 Impact factor: 2.086

5. Modeling place fields in terms of the cortical inputs to the hippocampus.

Authors: T Hartley; N Burgess; C Lever; F Cacucci; J O'Keefe
Journal: Hippocampus Date: 2000 Impact factor: 3.899

6. Contribution of multiple sensory information to place field stability in hippocampal place cells.

Authors: E Save; L Nerad; B Poucet
Journal: Hippocampus Date: 2000 Impact factor: 3.899

7. Home base behavior of rats (Rattus norvegicus) exploring a novel environment.

Authors: D Eilam; I Golani
Journal: Behav Brain Res Date: 1989-09-01 Impact factor: 3.332

8. The point of entry contributes to the organization of exploratory behavior of rats on an open field: an example of spontaneous episodic memory.

Authors: Farshad Nemati; Ian Q Whishaw
Journal: Behav Brain Res Date: 2007-05-21 Impact factor: 3.332

9. Odor supported place cell model and goal navigation in rodents.

Authors: Tomas Kulvicius; Minija Tamosiunaite; James Ainge; Paul Dudchenko; Florentin Wörgötter
Journal: J Comput Neurosci Date: 2008-04-23 Impact factor: 1.621

10. Hippocampal CA1 place cells encode intended destination on a maze with multiple choice points.

Authors: James A Ainge; Minija Tamosiunaite; Florentin Woergoetter; Paul A Dudchenko
Journal: J Neurosci Date: 2007-09-05 Impact factor: 6.167

2 in total

1. Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions.

Authors: Minija Tamosiunaite; Tamim Asfour; Florentin Wörgötter
Journal: Biol Cybern Date: 2009-02-20 Impact factor: 2.086

2. Forgetting in Reinforcement Learning Links Sustained Dopamine Signals to Motivation.

Authors: Ayaka Kato; Kenji Morita
Journal: PLoS Comput Biol Date: 2016-10-13 Impact factor: 4.475

2 in total