| Literature DB >> 17385640 |
Xiaolong Ma, Konstantin K Likharev.
Abstract
In this letter, we have found a more general formulation of the REward Increment = Nonnegative Factor x Offset Reinforcement x Characteristic Eligibility (REINFORCE) learning principle first suggested by Williams. The new formulation has enabled us to apply the principle to global reinforcement learning in networks with various sources of randomness, and to suggest several simple local rules for such networks. Numerical simulations have shown that for simple classification and reinforcement learning tasks, at least one family of the new learning rules gives results comparable to those provided by the famous Rules A(r-i) and A(r-p) for the Boltzmann machines.Entities:
Mesh:
Year: 2007 PMID: 17385640 DOI: 10.1109/TNN.2006.888376
Source DB: PubMed Journal: IEEE Trans Neural Netw ISSN: 1045-9227