Literature DB >> 23465562

First experiments with POWERPLAY.

Rupesh Kumar Srivastava1, Bas R Steunebrink, Jürgen Schmidhuber.   

Abstract

Like a scientist or a playing child, POWERPLAY (Schmidhuber, 2011) not only learns new skills to solve given problems, but also invents new interesting problems by itself. By design, it continually comes up with the fastest to find, initially novel, but eventually solvable tasks. It also continually simplifies or compresses or speeds up solutions to previous tasks. Here we describe first experiments with POWERPLAY. A self-delimiting recurrent neural network SLIM RNN (Schmidhuber, 2012) is used as a general computational problem solving architecture. Its connection weights can encode arbitrary, self-delimiting, halting or non-halting programs affecting both environment (through effectors) and internal states encoding abstractions of event sequences. Our POWERPLAY-driven SLIM RNN learns to become an increasingly general solver of self-invented problems, continually adding new problem solving procedures to its growing skill repertoire. Extending a recent conference paper (Srivastava, Steunebrink, Stollenga, & Schmidhuber, 2012), we identify interesting, emerging, developmental stages of our open-ended system. We also show how it automatically self-modularizes, frequently re-using code for previously invented skills, always trying to invent novel tasks that can be quickly validated because they do not require too many weight changes affecting too many previous tasks.
Copyright © 2013 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2013        PMID: 23465562     DOI: 10.1016/j.neunet.2013.01.022

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  6 in total

Review 1.  Information-seeking, curiosity, and attention: computational and neural mechanisms.

Authors:  Jacqueline Gottlieb; Pierre-Yves Oudeyer; Manuel Lopes; Adrien Baranes
Journal:  Trends Cogn Sci       Date:  2013-10-12       Impact factor: 20.229

2.  Self-organization of early vocal development in infants and machines: the role of intrinsic motivation.

Authors:  Clément Moulin-Frier; Sao M Nguyen; Pierre-Yves Oudeyer
Journal:  Front Psychol       Date:  2014-01-16

3.  An intrinsic value system for developing multiple invariant representations with incremental slowness learning.

Authors:  Matthew Luciw; Varun Kompella; Sohrob Kazerounian; Juergen Schmidhuber
Journal:  Front Neurorobot       Date:  2013-05-30       Impact factor: 2.650

4.  PowerPlay: Training an Increasingly General Problem Solver by Continually Searching for the Simplest Still Unsolvable Problem.

Authors:  Jürgen Schmidhuber
Journal:  Front Psychol       Date:  2013-06-07

5.  Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots.

Authors:  Hung Ngo; Matthew Luciw; Alexander Förster; Jürgen Schmidhuber
Journal:  Front Psychol       Date:  2013-11-26

6.  Curiosity driven reinforcement learning for motion planning on humanoids.

Authors:  Mikhail Frank; Jürgen Leitner; Marijn Stollenga; Alexander Förster; Jürgen Schmidhuber
Journal:  Front Neurorobot       Date:  2014-01-06       Impact factor: 2.650

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.