Literature DB >> 10085428

Parameter convergence and learning curves for neural networks.

T L Fine1, S Mukherjee.   

Abstract

We revisit the oft-studied asymptotic (in sample size) behavior of the parameter or weight estimate returned by any member of a large family of neural network training algorithms. By properly accounting for the characteristic property of neural networks that their empirical and generalization errors possess multiple minima, we rigorously establish conditions under which the parameter estimate converges strongly into the set of minima of the generalization error. Convergence of the parameter estimate to a particular value cannot be guaranteed under our assumptions. We then evaluate the asymptotic distribution of the distance between the parameter estimate and its nearest neighbor among the set of minima of the generalization error. Results on this question have appeared numerous times and generally assert asymptotic normality, the conclusion expected from familiar statistical arguments concerned with maximum likelihood estimators. These conclusions are usually reached on the basis of somewhat informal calculations, although we shall see that the situation is somewhat delicate. The preceding results then provide a derivation of learning curves for generalization and empirical errors that leads to bounds on rates of convergence.

Mesh:

Year:  1999        PMID: 10085428     DOI: 10.1162/089976699300016647

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  3 in total

1.  Generalization of learning by synchronous waves: from perceptual organization to invariant organization.

Authors:  David M Alexander; Chris Trengove; Phillip E Sheridan; Cees van Leeuwen
Journal:  Cogn Neurodyn       Date:  2010-12-10       Impact factor: 5.082

2.  Deterministic convergence of chaos injection-based gradient method for training feedforward neural networks.

Authors:  Huisheng Zhang; Ying Zhang; Dongpo Xu; Xiaodong Liu
Journal:  Cogn Neurodyn       Date:  2015-01-01       Impact factor: 5.082

3.  Analysis of the adsorption and retention models for Cd, Cr, Cu, Ni, Pb, and Zn through neural networks: selection of variables and competitive model.

Authors:  Juan J González-Costa; Manuel J Reigosa-Roger; José M Matías; Emma Fernández-Covelo
Journal:  Environ Sci Pollut Res Int       Date:  2018-06-29       Impact factor: 4.223

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.