| Literature DB >> 26379524 |
Christopher J Hasson1, Julia Manczurowsky1, Sheng-Che Yen2.
Abstract
Many gait training programs are based on supervised learning principles: an individual is guided towards a desired gait pattern with directional error feedback. While this results in rapid adaptation, improvements quickly disappear. This study tested the hypothesis that a reinforcement learning approach improves retention and transfer of a new gait pattern. The results of a pilot study and larger experiment are presented. Healthy subjects were randomly assigned to either a supervised group, who received explicit instructions and directional error feedback while they learned a new gait pattern on a treadmill, or a reinforcement group, who was only shown whether they were close to or far from the desired gait. Subjects practiced for 10 min, followed by immediate and overnight retention and over-ground transfer tests. The pilot study showed that subjects could learn a new gait pattern under a reinforcement learning paradigm. The larger experiment, which had twice as many subjects (16 in each group) showed that the reinforcement group had better overnight retention than the supervised group (a 32% vs. 120% error increase, respectively), but there were no differences for over-ground transfer. These results suggest that encouraging participants to find rewarding actions through self-guided exploration is beneficial for retention.Entities:
Keywords: adaptation; gait; motor learning; rehabilitation; reinforcement learning; retention; reward; supervised learning
Year: 2015 PMID: 26379524 PMCID: PMC4550775 DOI: 10.3389/fnhum.2015.00459
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Figure 1Visual feedback provided to the supervision and reinforcement groups.
Figure 2Experimental protocol. After practicing the gait task, retention and transfer were tested immediately and after a night of rest.
Figure 3Descriptive results of pilot study (Experiment 1), showing that subjects could learn a new gait pattern under a reinforcement learning paradigm. Performance was characterized by the logarithm of the absolute angular error is with respect to the goal ankle position over practice and on retention and transfer tests. Due to the small sample size and high within- and between-subject variability, only a descriptive analysis of this data was performed. At the start of practice the averages are shown for 5-trial bins, while the rest of practice (through late practice) shows 10-trial bins. Shading and error bars indicate the standard error.
Figure 4Results of Experiment 2, showing that the reinforcement group had better overnight retention compared to the supervised group. As in Experiment 1, performance was characterized by the logarithm of the absolute angular error is with respect to the goal ankle position over practice and on retention and transfer tests. In this experiment the sample size was doubled, an improved measurement system was used, and minor changes to the task were made (the target was reduced from 10° to 5°). At the start of practice the averages are shown for 5-trial bins, while the rest of practice (through late practice) shows 10-trial bins. Shading and error bars indicate the standard error. *Supervised group had a greater increase in error from immediate to overnight retention compared to the reinforcement group (Bonferroni-adjusted p = 0.04).