Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Archetypal landscapes for deep neural networks.

Literature DB >> 32843349

Archetypal landscapes for deep neural networks.

Philipp C Verpoort¹, Alpha A Lee², David J Wales³.

Abstract

The predictive capabilities of deep neural networks (DNNs) continue to evolve to increasingly impressive levels. However, it is still unclear how training procedures for DNNs succeed in finding parameters that produce good results for such high-dimensional and nonconvex loss functions. In particular, we wish to understand why simple optimization schemes, such as stochastic gradient descent, do not end up trapped in local minima with high loss values that would not yield useful predictions. We explain the optimizability of DNNs by characterizing the local minima and transition states of the loss-function landscape (LFL) along with their connectivity. We show that the LFL of a DNN in the shallow network or data-abundant limit is funneled, and thus easy to optimize. Crucially, in the opposite low-data/deep limit, although the number of minima increases, the landscape is characterized by many minima with similar loss values separated by low barriers. This organization is different from the hierarchical landscapes of structural glass formers and explains why minimization procedures commonly employed by the machine-learning community can navigate the LFL successfully and reach low-lying solutions.

Keywords: deep learning; energy landscapes; neural networks; optimization; statistical mechanics

Year: 2020 PMID： 32843349 PMCID： PMC7486703 DOI： 10.1073/pnas.1919995117

Source DB: PubMed Journal: Proc Natl Acad Sci U S A ISSN： 0027-8424 Impact factor: 11.205

28 in total

1. Integration of Machine Learning and Coarse-Grained Molecular Simulations for Polymer Materials: Physical Understandings and Molecular Design.

Authors: Danh Nguyen; Lei Tao; Ying Li
Journal: Front Chem Date: 2022-01-24 Impact factor: 5.221

1 in total

Archetypal landscapes for deep neural networks.

1. The protein folding network.

2. Energy landscapes of clusters bound by short-ranged potentials.

3. Connectivity in the potential energy landscape for binary Lennard-Jones systems.

Review 4. Energy landscapes: some new horizons.

5. Packing structures and transitions in liquids and solids.

6. Unification of algorithms for minimum mode optimization.

7. Energy landscapes for machine learning.

8. Monte Carlo-minimization approach to the multiple-minima problem in protein folding.

9. Pathways for diffusion in the potential energy landscape of the network glass former SiO₂.

10. Exploring the free energy landscape: from dynamics to networks and back.

1. Integration of Machine Learning and Coarse-Grained Molecular Simulations for Polymer Materials: Physical Understandings and Molecular Design.