Literature DB >> 33417125

Human locomotion with reinforcement learning using bioinspired reward reshaping strategies.

Katharine Nowakowski1, Philippe Carvalho1, Jean-Baptiste Six1, Yann Maillet1, Anh Tu Nguyen1, Ismail Seghiri1, Loick M'Pemba1, Theo Marcille1, Sy Toan Ngo1, Tien-Tuan Dao2,3.   

Abstract

Recent learning strategies such as reinforcement learning (RL) have favored the transition from applied artificial intelligence to general artificial intelligence. One of the current challenges of RL in healthcare relates to the development of a controller to teach a musculoskeletal model to perform dynamic movements. Several solutions have been proposed. However, there is still a lack of investigations exploring the muscle control problem from a biomechanical point of view. Moreover, no studies using biological knowledge to develop plausible motor control models for pathophysiological conditions make use of reward reshaping. Consequently, the objective of the present work was to design and evaluate specific bioinspired reward function strategies for human locomotion learning within an RL framework. The deep deterministic policy gradient (DDPG) method for a single-agent RL problem was applied. A 3D musculoskeletal model (8 DoF and 22 muscles) of a healthy adult was used. A virtual interactive environment was developed and simulated using opensim-rl library. Three reward functions were defined for walking, forward, and side falls. The training process was performed with Google Cloud Compute Engine. The obtained outcomes were compared to the NIPS 2017 challenge outcomes, experimental observations, and literature data. Regarding learning to walk, simulated musculoskeletal models were able to walk from 18 to 20.5 m for the best solutions. A compensation strategy of muscle activations was revealed. Soleus, tibia anterior, and vastii muscles are main actors of the simple forward fall. A higher intensity of muscle activations was also noted after the fall. All kinematics and muscle patterns were consistent with experimental observations and literature data. Regarding the side fall, an intensive level of muscle activation on the expected fall side to unbalance the body was noted. The obtained outcomes suggest that computational and human resources as well as biomechanical knowledge are needed together to develop and evaluate an efficient and robust RL solution. As perspectives, current solutions will be extended to a larger parameter space in 3D. Furthermore, a stochastic reinforcement learning model will be investigated in the future in scope with the uncertainties of the musculoskeletal model and associated environment to provide a general artificial intelligence solution for human locomotion learning. Graphical abstract.

Entities:  

Keywords:  Bioinspired reward reshaping; Fall; Human locomotion learning; Modern artificial intelligence (AI); Reinforcement learning; Walking

Year:  2021        PMID: 33417125     DOI: 10.1007/s11517-020-02309-3

Source DB:  PubMed          Journal:  Med Biol Eng Comput        ISSN: 0140-0118            Impact factor:   2.602


  13 in total

1.  Human intelligence: the model is the message.

Authors:  R J Sternberg
Journal:  Science       Date:  1985-12-06       Impact factor: 47.728

2.  THE BEGINNINGS OF INTELLIGENCE.

Authors:  S J Holmes
Journal:  Science       Date:  1911-03-31       Impact factor: 47.728

3.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

Review 4.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

5.  Quantum reinforcement learning during human decision-making.

Authors:  Ji-An Li; Daoyi Dong; Zhengde Wei; Ying Liu; Yu Pan; Franco Nori; Xiaochu Zhang
Journal:  Nat Hum Behav       Date:  2020-01-20

6.  Grandmaster level in StarCraft II using multi-agent reinforcement learning.

Authors:  Oriol Vinyals; Igor Babuschkin; Wojciech M Czarnecki; Michaël Mathieu; Andrew Dudzik; Junyoung Chung; David H Choi; Richard Powell; Timo Ewalds; Petko Georgiev; Junhyuk Oh; Dan Horgan; Manuel Kroiss; Ivo Danihelka; Aja Huang; Laurent Sifre; Trevor Cai; John P Agapiou; Chris Apps; David Silver; Max Jaderberg; Alexander S Vezhnevets; Rémi Leblond; Tobias Pohlen; Valentin Dalibard; David Budden; Yury Sulsky; James Molloy; Tom L Paine; Caglar Gulcehre; Ziyu Wang; Tobias Pfaff; Yuhuai Wu; Roman Ring; Dani Yogatama; Dario Wünsch; Katrina McKinney; Oliver Smith; Tom Schaul; Timothy Lillicrap; Koray Kavukcuoglu; Demis Hassabis
Journal:  Nature       Date:  2019-10-30       Impact factor: 49.962

7.  Guidelines for reinforcement learning in healthcare.

Authors:  Omer Gottesman; Fredrik Johansson; Matthieu Komorowski; Aldo Faisal; David Sontag; Finale Doshi-Velez; Leo Anthony Celi
Journal:  Nat Med       Date:  2019-01       Impact factor: 53.440

8.  Exploration and recency as the main proximate causes of probability matching: a reinforcement learning analysis.

Authors:  Carolina Feher da Silva; Camila Gomes Victorino; Nestor Caticha; Marcus Vinícius Chrysóstomo Baldo
Journal:  Sci Rep       Date:  2017-11-10       Impact factor: 4.379

9.  Learning of spatiotemporal patterns in a spiking neural network with resistive switching synapses.

Authors:  Wei Wang; Giacomo Pedretti; Valerio Milo; Roberto Carboni; Alessandro Calderoni; Nirmal Ramaswamy; Alessandro S Spinelli; Daniele Ielmini
Journal:  Sci Adv       Date:  2018-09-12       Impact factor: 14.136

Review 10.  Artificial intelligence in glioma imaging: challenges and advances.

Authors:  Weina Jin; Mostafa Fatehi; Kumar Abhishek; Mayur Mallya; Brian Toyota; Ghassan Hamarneh
Journal:  J Neural Eng       Date:  2020-04-30       Impact factor: 5.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.