Literature DB >> 31167336

Temporal Evolution of Generalization during Learning in Linear Networks.

Pierre Baldi1, Yves Chauvin2.   

Abstract

We study generalization in a simple framework of feedforward linear networks with n inputs and n outputs, trained from examples by gradient descent on the usual quadratic error function. We derive analytical results on the behavior of the validation function corresponding to the LMS error function calculated on a set of validation patterns. We show that the behavior of the validation function depends critically on the initial conditions and on the characteristics of the noise. Under certain simple assumptions, if the initial weights are sufficiently small, the validation function has a unique minimum corresponding to an optimal stopping time for training for which simple bounds can be calculated. There exists also situations where the validation function can have more complicated and somewhat unexpected behavior such as multiple local minima (at most n) of variable depth and long but finite plateau effects. Additional results and possible extensions are briefly discussed.

Entities:  

Year:  1991        PMID: 31167336     DOI: 10.1162/neco.1991.3.4.589

Source DB:  PubMed          Journal:  Neural Comput        ISSN: 0899-7667            Impact factor:   2.026


  1 in total

1.  High-dimensional dynamics of generalization error in neural networks.

Authors:  Madhu S Advani; Andrew M Saxe; Haim Sompolinsky
Journal:  Neural Netw       Date:  2020-09-05
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.