Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 First experiments with POWERPLAY.

Literature DB >> 23465562

First experiments with POWERPLAY.

Rupesh Kumar Srivastava¹, Bas R Steunebrink, Jürgen Schmidhuber.

Abstract

Like a scientist or a playing child, POWERPLAY (Schmidhuber, 2011) not only learns new skills to solve given problems, but also invents new interesting problems by itself. By design, it continually comes up with the fastest to find, initially novel, but eventually solvable tasks. It also continually simplifies or compresses or speeds up solutions to previous tasks. Here we describe first experiments with POWERPLAY. A self-delimiting recurrent neural network SLIM RNN (Schmidhuber, 2012) is used as a general computational problem solving architecture. Its connection weights can encode arbitrary, self-delimiting, halting or non-halting programs affecting both environment (through effectors) and internal states encoding abstractions of event sequences. Our POWERPLAY-driven SLIM RNN learns to become an increasingly general solver of self-invented problems, continually adding new problem solving procedures to its growing skill repertoire. Extending a recent conference paper (Srivastava, Steunebrink, Stollenga, & Schmidhuber, 2012), we identify interesting, emerging, developmental stages of our open-ended system. We also show how it automatically self-modularizes, frequently re-using code for previously invented skills, always trying to invent novel tasks that can be quickly validated because they do not require too many weight changes affecting too many previous tasks.

Entities: Species

Mesh：

Year: 2013 PMID： 23465562 DOI： 10.1016/j.neunet.2013.01.022

Source DB: PubMed Journal: Neural Netw ISSN： 0893-6080

Keyword Cloud
Cited

6 in total

Review 1. Information-seeking, curiosity, and attention: computational and neural mechanisms.

Authors: Jacqueline Gottlieb; Pierre-Yves Oudeyer; Manuel Lopes; Adrien Baranes
Journal: Trends Cogn Sci Date: 2013-10-12 Impact factor: 20.229

2. Self-organization of early vocal development in infants and machines: the role of intrinsic motivation.

Authors: Clément Moulin-Frier; Sao M Nguyen; Pierre-Yves Oudeyer
Journal: Front Psychol Date: 2014-01-16

3. An intrinsic value system for developing multiple invariant representations with incremental slowness learning.

Authors: Matthew Luciw; Varun Kompella; Sohrob Kazerounian; Juergen Schmidhuber
Journal: Front Neurorobot Date: 2013-05-30 Impact factor: 2.650

4. PowerPlay: Training an Increasingly General Problem Solver by Continually Searching for the Simplest Still Unsolvable Problem.

Authors: Jürgen Schmidhuber
Journal: Front Psychol Date: 2013-06-07

5. Confidence-based progress-driven self-generated goals for skill acquisition in developmental robots.

Authors: Hung Ngo; Matthew Luciw; Alexander Förster; Jürgen Schmidhuber
Journal: Front Psychol Date: 2013-11-26

6. Curiosity driven reinforcement learning for motion planning on humanoids.

Authors: Mikhail Frank; Jürgen Leitner; Marijn Stollenga; Alexander Förster; Jürgen Schmidhuber
Journal: Front Neurorobot Date: 2014-01-06 Impact factor: 2.650

6 in total