Literature DB >> 32843349

Archetypal landscapes for deep neural networks.

Philipp C Verpoort1, Alpha A Lee2, David J Wales3.   

Abstract

The predictive capabilities of deep neural networks (DNNs) continue to evolve to increasingly impressive levels. However, it is still unclear how training procedures for DNNs succeed in finding parameters that produce good results for such high-dimensional and nonconvex loss functions. In particular, we wish to understand why simple optimization schemes, such as stochastic gradient descent, do not end up trapped in local minima with high loss values that would not yield useful predictions. We explain the optimizability of DNNs by characterizing the local minima and transition states of the loss-function landscape (LFL) along with their connectivity. We show that the LFL of a DNN in the shallow network or data-abundant limit is funneled, and thus easy to optimize. Crucially, in the opposite low-data/deep limit, although the number of minima increases, the landscape is characterized by many minima with similar loss values separated by low barriers. This organization is different from the hierarchical landscapes of structural glass formers and explains why minimization procedures commonly employed by the machine-learning community can navigate the LFL successfully and reach low-lying solutions.

Keywords:  deep learning; energy landscapes; neural networks; optimization; statistical mechanics

Year:  2020        PMID: 32843349      PMCID: PMC7486703          DOI: 10.1073/pnas.1919995117

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  28 in total

1.  The protein folding network.

Authors:  Francesco Rao; Amedeo Caflisch
Journal:  J Mol Biol       Date:  2004-09-03       Impact factor: 5.469

2.  Energy landscapes of clusters bound by short-ranged potentials.

Authors:  David J Wales
Journal:  Chemphyschem       Date:  2010-08-23       Impact factor: 3.102

3.  Connectivity in the potential energy landscape for binary Lennard-Jones systems.

Authors:  Vanessa K de Souza; David J Wales
Journal:  J Chem Phys       Date:  2009-05-21       Impact factor: 3.488

Review 4.  Energy landscapes: some new horizons.

Authors:  David J Wales
Journal:  Curr Opin Struct Biol       Date:  2010-01-22       Impact factor: 6.809

5.  Packing structures and transitions in liquids and solids.

Authors:  F H Stillinger; T A Weber
Journal:  Science       Date:  1984-09-07       Impact factor: 47.728

6.  Unification of algorithms for minimum mode optimization.

Authors:  Yi Zeng; Penghao Xiao; Graeme Henkelman
Journal:  J Chem Phys       Date:  2014-01-28       Impact factor: 3.488

7.  Energy landscapes for machine learning.

Authors:  Andrew J Ballard; Ritankar Das; Stefano Martiniani; Dhagash Mehta; Levent Sagun; Jacob D Stevenson; David J Wales
Journal:  Phys Chem Chem Phys       Date:  2017-05-24       Impact factor: 3.676

8.  Monte Carlo-minimization approach to the multiple-minima problem in protein folding.

Authors:  Z Li; H A Scheraga
Journal:  Proc Natl Acad Sci U S A       Date:  1987-10       Impact factor: 11.205

9.  Pathways for diffusion in the potential energy landscape of the network glass former SiO2.

Authors:  S P Niblett; M Biedermann; D J Wales; V K de Souza
Journal:  J Chem Phys       Date:  2017-10-21       Impact factor: 3.488

10.  Exploring the free energy landscape: from dynamics to networks and back.

Authors:  Diego Prada-Gracia; Jesús Gómez-Gardeñes; Pablo Echenique; Fernando Falo
Journal:  PLoS Comput Biol       Date:  2009-06-26       Impact factor: 4.475

View more
  1 in total

1.  Integration of Machine Learning and Coarse-Grained Molecular Simulations for Polymer Materials: Physical Understandings and Molecular Design.

Authors:  Danh Nguyen; Lei Tao; Ying Li
Journal:  Front Chem       Date:  2022-01-24       Impact factor: 5.221

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.