Literature DB >> 31295691

Depth with nonlinearity creates no bad local minima in ResNets.

Kenji Kawaguchi1, Yoshua Bengio2.   

Abstract

In this paper, we prove that depth with nonlinearity creates no bad local minima in a type of arbitrarily deep ResNets with arbitrary nonlinear activation functions, in the sense that the values of all local minima are no worse than the global minimum value of corresponding classical machine-learning models, and are guaranteed to further improve via residual representations. As a result, this paper provides an affirmative answer to an open question stated in a paper in the conference on Neural Information Processing Systems 2018. This paper advances the optimization theory of deep learning only for ResNets and not for other network architectures.
Copyright © 2019 The Author(s). Published by Elsevier Ltd.. All rights reserved.

Keywords:  Deep learning; Local minima; Non-convex optimization; Residual neural network

Year:  2019        PMID: 31295691     DOI: 10.1016/j.neunet.2019.06.009

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  4 in total

1.  ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees.

Authors:  Kuan-Lin Chen; Ching-Hua Lee; Harinath Garudadri; Bhaskar D Rao
Journal:  Adv Neural Inf Process Syst       Date:  2021-12

2.  Glomerular Classification Using Convolutional Neural Networks Based on Defined Annotation Criteria and Concordance Evaluation Among Clinicians.

Authors:  Ryohei Yamaguchi; Yoshimasa Kawazoe; Kiminori Shimamoto; Emiko Shinohara; Tatsuo Tsukamoto; Yukako Shintani-Domoto; Hajime Nagasu; Hiroshi Uozaki; Tetsuo Ushiku; Masaomi Nangaku; Naoki Kashihara; Akira Shimizu; Michio Nagata; Kazuhiko Ohe
Journal:  Kidney Int Rep       Date:  2020-12-13

3.  Manipulation of free-floating objects using Faraday flows and deep reinforcement learning.

Authors:  David Hardman; Thomas George Thuruthel; Fumiya Iida
Journal:  Sci Rep       Date:  2022-01-10       Impact factor: 4.379

4.  Multimode Gesture Recognition Algorithm Based on Convolutional Long Short-Term Memory Network.

Authors:  Ming-Xing Lu; Guo-Zhen Du; Zhan-Fang Li
Journal:  Comput Intell Neurosci       Date:  2022-03-02
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.