Literature DB >> 24808076

Convergence analyses on on-line weight noise injection-based training algorithms for MLPs.

John Sum, Chi-Sing Leung, Kevin Ho.   

Abstract

Injecting weight noise during training is a simple technique that has been proposed for almost two decades. However, little is known about its convergence behavior. This paper studies the convergence of two weight noise injection-based training algorithms, multiplicative weight noise injection with weight decay and additive weight noise injection with weight decay. We consider that they are applied to multilayer perceptrons either with linear or sigmoid output nodes. Let w(t) be the weight vector, let V(w) be the corresponding objective function of the training algorithm, let α >; 0 be the weight decay constant, and let μ(t) be the step size. We show that if μ(t)→ 0, then with probability one E[||w(t)||2(2)] is bound and lim(t) → ∞ ||w(t)||2 exists. Based on these two properties, we show that if μ(t)→ 0, Σtμ(t)=∞, and Σtμ(t)(2) <; ∞, then with probability one these algorithms converge. Moreover, w(t) converges with probability one to a point where ∇wV(w)=0.

Entities:  

Year:  2012        PMID: 24808076     DOI: 10.1109/TNNLS.2012.2210243

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  1 in total

1.  Deterministic convergence of chaos injection-based gradient method for training feedforward neural networks.

Authors:  Huisheng Zhang; Ying Zhang; Dongpo Xu; Xiaodong Liu
Journal:  Cogn Neurodyn       Date:  2015-01-01       Impact factor: 5.082

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.