Literature DB >> 33501265

Efficacy of Modern Neuro-Evolutionary Strategies for Continuous Control Optimization.

Paolo Pagliuca1, Nicola Milano1, Stefano Nolfi1,2.   

Abstract

We analyze the efficacy of modern neuro-evolutionary strategies for continuous control optimization. Overall, the results collected on a wide variety of qualitatively different benchmark problems indicate that these methods are generally effective and scale well with respect to the number of parameters and the complexity of the problem. Moreover, they are relatively robust with respect to the setting of hyper-parameters. The comparison of the most promising methods indicates that the OpenAI-ES algorithm outperforms or equals the other algorithms on all considered problems. Moreover, we demonstrate how the reward functions optimized for reinforcement learning methods are not necessarily effective for evolutionary strategies and vice versa. This finding can lead to reconsideration of the relative efficacy of the two classes of algorithm since it implies that the comparisons performed to date are biased toward one or the other class.
Copyright © 2020 Pagliuca, Milano and Nolfi.

Entities:  

Keywords:  continuous control optimization; evolutionary strategies; fitness function design; natural evolutionary strategies; reinforcement learning

Year:  2020        PMID: 33501265      PMCID: PMC7805676          DOI: 10.3389/frobt.2020.00098

Source DB:  PubMed          Journal:  Front Robot AI        ISSN: 2296-9144


  5 in total

Review 1.  Completely derandomized self-adaptation in evolution strategies.

Authors:  N Hansen; A Ostermeier
Journal:  Evol Comput       Date:  2001       Impact factor: 3.277

2.  Human-level control through deep reinforcement learning.

Authors:  Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Andrei A Rusu; Joel Veness; Marc G Bellemare; Alex Graves; Martin Riedmiller; Andreas K Fidjeland; Georg Ostrovski; Stig Petersen; Charles Beattie; Amir Sadik; Ioannis Antonoglou; Helen King; Dharshan Kumaran; Daan Wierstra; Shane Legg; Demis Hassabis
Journal:  Nature       Date:  2015-02-26       Impact factor: 49.962

3.  Reinforcement learning of motor skills with policy gradients.

Authors:  Jan Peters; Stefan Schaal
Journal:  Neural Netw       Date:  2008-04-26

4.  Maximizing adaptive power in neuroevolution.

Authors:  Paolo Pagliuca; Nicola Milano; Stefano Nolfi
Journal:  PLoS One       Date:  2018-07-18       Impact factor: 3.240

5.  Robust optimization through neuroevolution.

Authors:  Paolo Pagliuca; Stefano Nolfi
Journal:  PLoS One       Date:  2019-03-01       Impact factor: 3.240

  5 in total
  2 in total

1.  Autonomous learning of features for control: Experiments with embodied and situated agents.

Authors:  Nicola Milano; Stefano Nolfi
Journal:  PLoS One       Date:  2021-04-15       Impact factor: 3.240

2.  Policy search with rare significant events: Choosing the right partner to cooperate with.

Authors:  Paul Ecoffet; Nicolas Fontbonne; Jean-Baptiste André; Nicolas Bredeche
Journal:  PLoS One       Date:  2022-04-26       Impact factor: 3.752

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.