Literature DB >> 32831616

Locally adaptive activation functions with slope recovery for deep and physics-informed neural networks.

Ameya D Jagtap1, Kenji Kawaguchi2, George Em Karniadakis1,3.   

Abstract

We propose two approaches of locally adaptive activation functions namely, layer-wise and neuron-wise locally adaptive activation functions, which improve the performance of deep and physics-informed neural networks. The local adaptation of activation function is achieved by introducing a scalable parameter in each layer (layer-wise) and for every neuron (neuron-wise) separately, and then optimizing it using a variant of stochastic gradient descent algorithm. In order to further increase the training speed, an activation slope-based slope recovery term is added in the loss function, which further accelerates convergence, thereby reducing the training cost. On the theoretical side, we prove that in the proposed method, the gradient descent algorithms are not attracted to sub-optimal critical points or local minima under practical conditions on the initialization and learning rate, and that the gradient dynamics of the proposed method is not achievable by base methods with any (adaptive) learning rates. We further show that the adaptive activation methods accelerate the convergence by implicitly multiplying conditioning matrices to the gradient of the base method without any explicit computation of the conditioning matrix and the matrix-vector product. The different adaptive activation functions are shown to induce different implicit conditioning matrices. Furthermore, the proposed methods with the slope recovery are shown to accelerate the training process.
© 2020 The Author(s).

Keywords:  accelerated training; bad minima; deep learning benchmarks; machine learning; physics-informed neural networks; stochastic gradients

Year:  2020        PMID: 32831616      PMCID: PMC7426042          DOI: 10.1098/rspa.2020.0334

Source DB:  PubMed          Journal:  Proc Math Phys Eng Sci        ISSN: 1364-5021            Impact factor:   2.704


  12 in total

1.  Thermal fluid fields reconstruction for nanofluids convection based on physics-informed deep learning.

Authors:  Yunzhu Li; Tianyuan Liu; Yonghui Xie
Journal:  Sci Rep       Date:  2022-07-22       Impact factor: 4.996

2.  Physics-informed attention-based neural network for hyperbolic partial differential equations: application to the Buckley-Leverett problem.

Authors:  Ruben Rodriguez-Torrado; Pablo Ruiz; Luis Cueto-Felgueroso; Michael Cerny Green; Tyler Friesen; Sebastien Matringe; Julian Togelius
Journal:  Sci Rep       Date:  2022-05-09       Impact factor: 4.996

3.  Physics-Informed Neural Networks for Brain Hemodynamic Predictions Using Medical Imaging.

Authors:  Mohammad Sarabian; Hessam Babaee; Kaveh Laksari
Journal:  IEEE Trans Med Imaging       Date:  2022-08-31       Impact factor: 11.037

4.  On transformative adaptive activation functions in neural networks for gene expression inference.

Authors:  Vladimír Kunc; Jiří Kléma
Journal:  PLoS One       Date:  2021-01-14       Impact factor: 3.240

5.  Breast Cancer Mammograms Classification Using Deep Neural Network and Entropy-Controlled Whale Optimization Algorithm.

Authors:  Saliha Zahoor; Umar Shoaib; Ikram Ullah Lali
Journal:  Diagnostics (Basel)       Date:  2022-02-21

6.  Data-driven discovery of Green's functions with human-understandable deep learning.

Authors:  Nicolas Boullé; Christopher J Earls; Alex Townsend
Journal:  Sci Rep       Date:  2022-03-22       Impact factor: 4.379

7.  Analyses of internal structures and defects in materials using physics-informed neural networks.

Authors:  Enrui Zhang; Ming Dao; George Em Karniadakis; Subra Suresh
Journal:  Sci Adv       Date:  2022-02-16       Impact factor: 14.136

Review 8.  Diagnostic Strategies for Breast Cancer Detection: From Image Generation to Classification Strategies Using Artificial Intelligence Algorithms.

Authors:  Jesus A Basurto-Hurtado; Irving A Cruz-Albarran; Manuel Toledano-Ayala; Mario Alberto Ibarra-Manzano; Luis A Morales-Hernandez; Carlos A Perez-Ramirez
Journal:  Cancers (Basel)       Date:  2022-07-15       Impact factor: 6.575

9.  Learning hidden elasticity with deep neural networks.

Authors:  Chun-Teh Chen; Grace X Gu
Journal:  Proc Natl Acad Sci U S A       Date:  2021-08-03       Impact factor: 11.205

10.  Expert-enhanced machine learning for cardiac arrhythmia classification.

Authors:  Sebastian Sager; Felix Bernhardt; Florian Kehrle; Maximilian Merkert; Andreas Potschka; Benjamin Meder; Hugo Katus; Eberhard Scholz
Journal:  PLoS One       Date:  2021-12-23       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.