Literature DB >> 22162982

Neuroevolutionary reinforcement learning for generalized control of simulated helicopters.

Rogier Koppejan1, Shimon Whiteson.   

Abstract

This article presents an extended case study in the application of neuroevolution to generalized simulated helicopter hovering, an important challenge problem for reinforcement learning. While neuroevolution is well suited to coping with the domain's complex transition dynamics and high-dimensional state and action spaces, the need to explore efficiently and learn on-line poses unusual challenges. We propose and evaluate several methods for three increasingly challenging variations of the task, including the method that won first place in the 2008 Reinforcement Learning Competition. The results demonstrate that (1) neuroevolution can be effective for complex on-line reinforcement learning tasks such as generalized helicopter hovering, (2) neuroevolution excels at finding effective helicopter hovering policies but not at learning helicopter models, (3) due to the difficulty of learning reliable models, model-based approaches to helicopter hovering are feasible only when domain expertise is available to aid the design of a suitable model representation and (4) recent advances in efficient resampling can enable neuroevolution to tackle more aggressively generalized reinforcement learning tasks.

Entities:  

Year:  2011        PMID: 22162982      PMCID: PMC3214260          DOI: 10.1007/s12065-011-0066-z

Source DB:  PubMed          Journal:  Evol Intell        ISSN: 1864-5909


  8 in total

1.  The gambler's ruin problem, genetic algorithms, and the sizing of populations.

Authors:  G Harik; E Cantu-Paz; D E Goldberg; B L Miller
Journal:  Evol Comput       Date:  1999       Impact factor: 3.277

2.  Evolving neural networks through augmenting topologies.

Authors:  Kenneth O Stanley; Risto Miikkulainen
Journal:  Evol Comput       Date:  2002       Impact factor: 3.277

3.  Reducing the time complexity of the derandomized evolution strategy with covariance matrix adaptation (CMA-ES).

Authors:  Nikolaus Hansen; Sibylle D Müller; Petros Koumoutsakos
Journal:  Evol Comput       Date:  2003       Impact factor: 3.277

4.  Autonomous evolution of topographic regularities in artificial neural networks.

Authors:  Jason Gauci; Kenneth O Stanley
Journal:  Neural Comput       Date:  2010-07       Impact factor: 2.026

5.  A neural learning classifier system with self-adaptive constructivism for mobile robot control.

Authors:  Jacob Hurst; Larry Bull
Journal:  Artif Life       Date:  2006       Impact factor: 0.667

6.  Evolution of homing navigation in a real mobile robot.

Authors:  D Floreano; F Mondada
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  1996

7.  Combined genetic algorithm optimization and regularized orthogonal least squares learning for radial basis function networks.

Authors:  S Chen; Y Wu; B L Luk
Journal:  IEEE Trans Neural Netw       Date:  1999

8.  A hypercube-based encoding for evolving large-scale neural networks.

Authors:  Kenneth O Stanley; David B D'Ambrosio; Jason Gauci
Journal:  Artif Life       Date:  2009       Impact factor: 0.667

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.