| Literature DB >> 27066332 |
Qinwei Fan1, Wei Wu2, Jacek M Zurada3.
Abstract
This paper presents new theoretical results on the backpropagation algorithm with smoothing [Formula: see text] regularization and adaptive momentum for feedforward neural networks with a single hidden layer, i.e., we show that the gradient of error function goes to zero and the weight sequence goes to a fixed point as n (n is iteration steps) tends to infinity, respectively. Also, our results are more general since we do not require the error function to be quadratic or uniformly convex, and neuronal activation functions are relaxed. Moreover, compared with existed algorithms, our novel algorithm can get more sparse network structure, namely it forces weights to become smaller during the training and can eventually removed after the training, which means that it can simply the network structure and lower operation time. Finally, two numerical experiments are presented to show the characteristics of the main results in detail.Entities:
Keywords: Adaptive momentum; Convergence; Feedforward neural networks; Smoothing [Formula: see text] regularization
Year: 2016 PMID: 27066332 PMCID: PMC4783325 DOI: 10.1186/s40064-016-1931-0
Source DB: PubMed Journal: Springerplus ISSN: 2193-1801
Fig. 1Feedforward neural network with one hidden layer and one output
3-bit parity problem
| Input | Output | Input | Output | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 1 | 1 | 1 | −1 | 1 | 1 | −1 | −1 | −1 | 1 |
| 1 | 1 | −1 | −1 | 0 | −1 | 1 | 1 | −1 | 0 |
| 1 | −1 | 1 | −1 | 0 | −1 | −1 | 1 | −1 | 1 |
| −1 | −1 | −1 | −1 | 0 | −1 | 1 | −1 | −1 | 1 |
Fig. 2The curve of error function
Fig. 3The curve of norm of gradient
Fig. 4The curve of norm of weight