Literature DB >> 17385640

Global reinforcement learning in neural networks.

Xiaolong Ma, Konstantin K Likharev.   

Abstract

In this letter, we have found a more general formulation of the REward Increment = Nonnegative Factor x Offset Reinforcement x Characteristic Eligibility (REINFORCE) learning principle first suggested by Williams. The new formulation has enabled us to apply the principle to global reinforcement learning in networks with various sources of randomness, and to suggest several simple local rules for such networks. Numerical simulations have shown that for simple classification and reinforcement learning tasks, at least one family of the new learning rules gives results comparable to those provided by the famous Rules A(r-i) and A(r-p) for the Boltzmann machines.

Entities:  

Mesh:

Year:  2007        PMID: 17385640     DOI: 10.1109/TNN.2006.888376

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw        ISSN: 1045-9227


  3 in total

1.  Spike-timing-dependent plasticity learning of coincidence detection with passively integrated memristive circuits.

Authors:  M Prezioso; M R Mahmoodi; F Merrikh Bayat; H Nili; H Kim; A Vincent; D B Strukov
Journal:  Nat Commun       Date:  2018-12-14       Impact factor: 14.919

2.  Versatile stochastic dot product circuits based on nonvolatile memories for high performance neurocomputing and neurooptimization.

Authors:  M R Mahmoodi; M Prezioso; D B Strukov
Journal:  Nat Commun       Date:  2019-11-08       Impact factor: 14.919

3.  Segmentation of neuronal structures using SARSA (λ)-based boundary amendment with reinforced gradient-descent curve shape fitting.

Authors:  Fei Zhu; Quan Liu; Yuchen Fu; Bairong Shen
Journal:  PLoS One       Date:  2014-03-13       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.