Literature DB >> 24291693

Batch gradient method with smoothing L1/2 regularization for training of feedforward neural networks.

Wei Wu1, Qinwei Fan2, Jacek M Zurada3, Jian Wang4, Dakun Yang1, Yan Liu5.   

Abstract

The aim of this paper is to develop a novel method to prune feedforward neural networks by introducing an L1/2 regularization term into the error function. This procedure forces weights to become smaller during the training and can eventually removed after the training. The usual L1/2 regularization term involves absolute values and is not differentiable at the origin, which typically causes oscillation of the gradient of the error function during the training. A key point of this paper is to modify the usual L1/2 regularization term by smoothing it at the origin. This approach offers the following three advantages: First, it removes the oscillation of the gradient value. Secondly, it gives better pruning, namely the final weights to be removed are smaller than those produced through the usual L1/2 regularization. Thirdly, it makes it possible to prove the convergence of the training. Supporting numerical examples are also provided.
Copyright © 2013 Elsevier Ltd. All rights reserved.

Keywords:  Batch gradient method; Convergence; Feedforward neural networks; Smoothing regularization

Mesh:

Year:  2013        PMID: 24291693     DOI: 10.1016/j.neunet.2013.11.006

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  1 in total

1.  Convergence of batch gradient learning with smoothing regularization and adaptive momentum for neural networks.

Authors:  Qinwei Fan; Wei Wu; Jacek M Zurada
Journal:  Springerplus       Date:  2016-03-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.