Literature DB >> 11681750

Algebraic geometrical methods for hierarchical learning machines.

S Watanabe1.   

Abstract

Hierarchical learning machines such as layered perceptrons, radial basis functions, Gaussian mixtures are non-identifiable learning machines, whose Fisher information matrices are not positive definite. This fact shows that conventional statistical asymptotic theory cannot be applied to neural network learning theory, for example either the Bayesian a posteriori probability distribution does not converge to the Gaussian distribution, or the generalization error is not in proportion to the number of parameters. The purpose of this paper is to overcome this problem and to clarify the relation between the learning curve of a hierarchical learning machine and the algebraic geometrical structure of the parameter space. We establish an algorithm to calculate the Bayesian stochastic complexity based on blowing-up technology in algebraic geometry and prove that the Bayesian generalization error of a hierarchical learning machine is smaller than that of a regular statistical model, even if the true distribution is not contained in the parametric model.

Mesh:

Year:  2001        PMID: 11681750     DOI: 10.1016/s0893-6080(01)00069-7

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  2 in total

1.  Learning Coefficient of Vandermonde Matrix-Type Singularities in Model Selection.

Authors:  Miki Aoyagi
Journal:  Entropy (Basel)       Date:  2019-06-04       Impact factor: 2.524

2.  Development of spectral decomposition based on Bayesian information criterion with estimation of confidence interval.

Authors:  Hiroshi Shinotsuka; Kenji Nagata; Hideki Yoshikawa; Yoh-Ichi Mototake; Hayaru Shouno; Masato Okada
Journal:  Sci Technol Adv Mater       Date:  2020-07-02       Impact factor: 8.090

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.