Literature DB >> 10937965

Local minima and plateaus in hierarchical structures of multilayer perceptrons.

K Fukumizu1, S Amari.   

Abstract

Local minima and plateaus pose a serious problem in learning of neural networks. We investigate the hierarchical geometric structure of the parameter space of three-layer perceptrons in order to show the existence of local minima and plateaus. It is proved that a critical point of the model with H - 1 hidden units always gives many critical points of the model with H hidden units. These critical points consist of many lines in the parameter space, which can cause plateaus in learning of neural networks. Based on this result, we prove that a point in the critical lines corresponding to the global minimum of the smaller model can be a local minimum or a saddle point of the larger model. We give a necessary and sufficient condition for this, and show that this kind of local minima exist as a line segment if any. The results are universal in the sense that they do not require special properties of the target, loss functions and activation functions, but only use the hierarchical structure of the model.

Mesh:

Year:  2000        PMID: 10937965     DOI: 10.1016/s0893-6080(00)00009-5

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  2 in total

1.  Learning, memory, and the role of neural network architecture.

Authors:  Ann M Hermundstad; Kevin S Brown; Danielle S Bassett; Jean M Carlson
Journal:  PLoS Comput Biol       Date:  2011-06-30       Impact factor: 4.475

2.  Decoding of Human Movements Based on Deep Brain Local Field Potentials Using Ensemble Neural Networks.

Authors:  Mohammad S Islam; Khondaker A Mamun; Hai Deng
Journal:  Comput Intell Neurosci       Date:  2017-10-19
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.