| Literature DB >> 35669385 |
Sharlen Moore1, Kishore V Kuchibhotla1,2,3.
Abstract
Learning is fundamental to animal survival. Animals must learn to link sensory cues in the environment to actions that lead to reward or avoid punishment. Rapid learning can then be highly adaptive and the difference between life or death. To explore the neural dynamics and circuits that underlie learning, however, has typically required the use of laboratory paradigms with tight control of stimuli, action sets, and outcomes. Learning curves in such reward-based tasks are reported as slow and gradual, with animals often taking hundreds to thousands of trials to reach expert performance. The slow, highly variable, and incremental learning curve remains the largely unchallenged belief in modern systems neuroscience. Here, we provide historical and contemporary evidence that instrumental forms of reward-learning can be dissociated into two parallel processes: knowledge acquisition which is rapid with step-like improvements, and behavioral expression which is slower and more variable. We further propose that this conceptual distinction may allow us to isolate the associative (knowledge-related) and non-associative (performance-related) components that influence learning. We then discuss the implications that this revised understanding of the learning curve has for systems neuroscience.Entities:
Keywords: Acquisition; Behavior; Big data; Goal-directed learning; Instrumental learning; Large-scale recordings; Learning; Stimulus-response; Systems neuroscience; circuit
Year: 2022 PMID: 35669385 PMCID: PMC9163689 DOI: 10.1016/j.ibneur.2022.05.006
Source DB: PubMed Journal: IBRO Neurosci Rep ISSN: 2667-2421
Fig. 1Methodological drivers of a slow learning curve. A) The effect of group averaging across animals. Left, schematic of individual animal learning curves (gray lines), defined learning criterion (dotted line), and threshold crossings (red circles). Middle, averaging individual learning curves aligned to the start of training creates the appearance of a slow and gradual process. Right, aligning learning curves to a defined learning criterion identifies a more rapid, and shared, dynamic across animals (within the red dotted box) and may provide better group averaging for use in neural data analysis. B) The effect of session averaging within an animal. Schematic of learning curve across training sessions shows a smooth gradual increase in performance. Early (left inset) and late (right inset) in learning, the session averaged performance provides a reasonable description of the behavior. At the ‘slope’ of the learning curve, however, the within day change (middle inset) can be dramatic with fast transitions in performance that are obscured by session-based averaging. C) The effect of motivation on within day performance. Expert performance can be influenced by an animals’ internal state. Motivation can change over the course of an expert session, driving errors typically ascribed to perceptual judgements. Early in the session (1), over motivation might be the driver of a high false alarm rate, while by the end, satiety might drive an animal to miss. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Behavioral dissociation of acquisition and expression. Mice were trained on an auditory go/no-go task in which they learn to lick to tone for a water reward (S+) and withhold licking to another tone to avoid a timeout (S-). Performance during learning in a reinforced context (top) has classically been equated to the ‘acquisition’ of task contingencies. In our data, we observe similar gradual acquisition trajectories in the reinforced context (top). We unmasked a more rapid acquisition trajectory by removing access to reinforcement in a few trials (bottom), and argue for a second dissociable process, ‘expression’, which reveals learned discriminations.