| Literature DB >> 12416698 |
Abstract
This article is concerned with the representation and generalisation of continuous action spaces in reinforcement learning (RL) problems. A model is proposed based on the self-organising map (SOM) of Kohonen [Self Organisation and Associative Memory, 1987] which allows either the one-to-one, many-to-one or one-to-many structure of the desired state-action mapping to be captured. Although presented here for tasks involving immediate reward, the approach is easily extended to delayed reward. We conclude that the SOM is a useful tool for providing real-time, on-line generalisation in RL problems in which the latent dimensionalities of the state and action spaces are small. Scalability issues are also discussed.Mesh:
Year: 2002 PMID: 12416698 DOI: 10.1016/s0893-6080(02)00083-7
Source DB: PubMed Journal: Neural Netw ISSN: 0893-6080