| Literature DB >> 32719595 |
Zhinan Peng1, Rui Luo1, Rui Huang1, Tengbo Yu2, Jiangping Hu1, Kecheng Shi1, Hong Cheng1.
Abstract
More recently, lower limb exoskeletons (LLE) have gained considerable interests in strength augmentation, rehabilitation, and walking assistance scenarios. For walking assistance, the LLE is expected to control the affected leg to track the unaffected leg's motion naturally. A critical issue in this scenario is that the exoskeleton system needs to deal with unpredictable disturbance from the patient, and the controller has the ability to adapt to different wearers. To this end, a novel data-driven optimal control (DDOC) strategy is proposed to adapt different hemiplegic patients with unpredictable disturbances. The interaction relation between two lower limbs of LLE and the leg of patient's unaffected side are modeled in the context of leader-follower framework. Then, the walking assistance control problem is transformed into an optimal control problem. A policy iteration (PI) algorithm is utilized to obtain the optimal controller. To improve the online adaptation to different patients, an actor-critic neural network (AC/NN) structure of the reinforcement learning (RL) is employed to learn the optimal controller on the basis of PI algorithm. Finally, experiments both on a simulation environment and a real LLE system are conducted to verify the effectiveness of the proposed walking assistance control method.Entities:
Keywords: actor-critic neural network; hemiplegic patients; leader-follower multi-agent system; lower limb exoskeleton; reinforcement learning; walking assistance control
Year: 2020 PMID: 32719595 PMCID: PMC7347968 DOI: 10.3389/fnbot.2020.00037
Source DB: PubMed Journal: Front Neurorobot ISSN: 1662-5218 Impact factor: 2.650
Figure 1The modeling of Leader-Follower Multi-Agent System. (A) The schematic diagram of the LFMAS (The notations 0, 1, and 2 denote leader, follower 1, and follower 2, respectively). (B) Communication topology network of the LFMAS structure.
Figure 2Convergence of the AC/NN weights on 2-DOF simulation platform. (A) Actor network. (B) Critic network.
Figure 3The trajectories tracking performance of joint angle of follower 1 on 2-DOF simulation platform. (A) Link 1. (B) Link 2.
Figure 4The LLE system called AIDER for hemiplegic patient. 1. The subject/wearer; 2. Smart shoes with plantar pressure sensors inside; 3. The load backpack with embedded computer, IMU and power unit; 4. Active joints with node controllers (hip joints and knee joints).
Figure 5The trajectories of the AC/NN weights for AIDER with subject 1 in the experiment: (A) Actor weights. (B) Critic weights.
Figure 6The tracking control performance performance of the proposed DDOC strategy on AIDER with subject 1 in the experiment: (A) Hip joint's angle. (B) Knee joint's angle.
Optimal Walking Assistance Control Algorithm.
| 1: |
| 2: Initialize the values of critic weight ŵ |
| 3: Set the learning rates of the critic network and actor network to be ρ |
| 4: Choose a sufficiently small computation precision ϵ; |
| 5: Let |
| 6: |
| 7: Calculate the actor network to estimate the control strategy û |
| 8: Calculate the critic network to estimate the cost function |
| 9: According to the available system data |
| 10: Calculate the objective function |
| 11: Update the weights in the critic NNs using ŵ |
| 12: Calculate the objective function |
| 13: Update the weights in the actor NNs using ŵ |
| 14: |