Literature DB >> 28475063

Training an Actor-Critic Reinforcement Learning Controller for Arm Movement Using Human-Generated Rewards.

Kathleen M Jagodnik, Philip S Thomas, Antonie J van den Bogert, Michael S Branicky, Robert F Kirsch.   

Abstract

Functional Electrical Stimulation (FES) employs neuroprostheses to apply electrical current to the nerves and muscles of individuals paralyzed by spinal cord injury to restore voluntary movement. Neuroprosthesis controllers calculate stimulation patterns to produce desired actions. To date, no existing controller is able to efficiently adapt its control strategy to the wide range of possible physiological arm characteristics, reaching movements, and user preferences that vary over time. Reinforcement learning (RL) is a control strategy that can incorporate human reward signals as inputs to allow human users to shape controller behavior. In this paper, ten neurologically intact human participants assigned subjective numerical rewards to train RL controllers, evaluating animations of goal-oriented reaching tasks performed using a planar musculoskeletal human arm simulation. The RL controller learning achieved using human trainers was compared with learning accomplished using human-like rewards generated by an algorithm; metrics included success at reaching the specified target; time required to reach the target; and target overshoot. Both sets of controllers learned efficiently and with minimal differences, significantly outperforming standard controllers. Reward positivity and consistency were found to be unrelated to learning success. These results suggest that human rewards can be used effectively to train RL-based FES controllers.

Entities:  

Mesh:

Year:  2017        PMID: 28475063      PMCID: PMC7523734          DOI: 10.1109/TNSRE.2017.2700395

Source DB:  PubMed          Journal:  IEEE Trans Neural Syst Rehabil Eng        ISSN: 1534-4320            Impact factor:   3.802


  21 in total

1.  Reinforcement learning in continuous time and space.

Authors:  K Doya
Journal:  Neural Comput       Date:  2000-01       Impact factor: 2.026

2.  Restoration of elbow extension via functional electrical stimulation in individuals with tetraplegia.

Authors:  William D Memberg; Patrick E Crago; Michael W Keith
Journal:  J Rehabil Res Dev       Date:  2003 Nov-Dec

3.  Closed-loop wrist stabilization in C4 and C5 tetraplegia.

Authors:  M A Lemay; P E Crago
Journal:  IEEE Trans Rehabil Eng       Date:  1997-09

4.  Analysis of an optimal control model of multi-joint arm movements.

Authors:  N Lan
Journal:  Biol Cybern       Date:  1997-02       Impact factor: 2.086

5.  Application of the Actor-Critic Architecture to Functional Electrical Stimulation Control of a Human Arm.

Authors:  Philip Thomas; Michael Branicky; Antonie van den Bogert; Kathleen Jagodnik
Journal:  Proc Innov Appl Artif Intell Conf       Date:  2009

6.  Creating a Reinforcement Learning Controller for Functional Electrical Stimulation of a Human Arm.

Authors:  Philip S Thomas; Michael Branicky; Antonie van den Bogert; Kathleen Jagodnik
Journal:  Yale Workshop Adapt Learn Syst       Date:  2008

7.  Muscle weakness, paralysis, and atrophy after human cervical spinal cord injury.

Authors:  C K Thomas; E Y Zaidner; B Calancie; J G Broton; B R Bigland-Ritchie
Journal:  Exp Neurol       Date:  1997-12       Impact factor: 5.330

8.  Feedback regulation of hand grasp opening and contact force during stimulation of paralyzed muscle.

Authors:  P E Crago; R J Nakai; H J Chizeck
Journal:  IEEE Trans Biomed Eng       Date:  1991-01       Impact factor: 4.538

9.  Influence of complete spinal cord injury on skeletal muscle within 6 mo of injury.

Authors:  M J Castro; D F Apple; R S Staron; G E Campos; G A Dudley
Journal:  J Appl Physiol (1985)       Date:  1999-01

10.  Development and validation of a 3-D model to predict knee joint loading during dynamic movement.

Authors:  S G McLean; A Su; A J van den Bogert
Journal:  J Biomech Eng       Date:  2003-12       Impact factor: 2.097

View more
  4 in total

1.  Holding Static Arm Configurations With Functional Electrical Stimulation: A Case Study.

Authors:  Derek N Wolf; Eric M Schearer
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2018-08-20       Impact factor: 3.802

2.  Sub-optimally Solving Actuator Redundancy in a Hybrid Neuroprosthetic System with a Multi-layer Neural Network Structure.

Authors:  Xuefeng Bao; Zhi-Hong Mao; Paul Munro; Ziyue Sun; Nitin Sharma
Journal:  Int J Intell Robot Appl       Date:  2019-08-14

3.  Improving the Learning Rate, Accuracy, and Workspace of Reinforcement Learning Controllers for a Musculoskeletal Model of the Human Arm.

Authors:  Douglas C Crowder; Jessica Abreu; Robert F Kirsch
Journal:  IEEE Trans Neural Syst Rehabil Eng       Date:  2022-01-28       Impact factor: 3.802

4.  Reinforcement Learning-Based End-to-End Parking for Automatic Parking System.

Authors:  Peizhi Zhang; Lu Xiong; Zhuoping Yu; Peiyuan Fang; Senwei Yan; Jie Yao; Yi Zhou
Journal:  Sensors (Basel)       Date:  2019-09-16       Impact factor: 3.576

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.