Literature DB >> 31880565

diffGrad: An Optimization Method for Convolutional Neural Networks.

Shiv Ram Dubey, Soumendu Chakraborty, Swalpa Kumar Roy, Snehasis Mukherjee, Satish Kumar Singh, Bidyut Baran Chaudhuri.   

Abstract

Stochastic gradient descent (SGD) is one of the core techniques behind the success of deep neural networks. The gradient provides information on the direction in which a function has the steepest rate of change. The main problem with basic SGD is to change by equal-sized steps for all parameters, irrespective of the gradient behavior. Hence, an efficient way of deep network optimization is to have adaptive step sizes for each parameter. Recently, several attempts have been made to improve gradient descent methods such as AdaGrad, AdaDelta, RMSProp, and adaptive moment estimation (Adam). These methods rely on the square roots of exponential moving averages of squared past gradients. Thus, these methods do not take advantage of local change in gradients. In this article, a novel optimizer is proposed based on the difference between the present and the immediate past gradient (i.e., diffGrad). In the proposed diffGrad optimization technique, the step size is adjusted for each parameter in such a way that it should have a larger step size for faster gradient changing parameters and a lower step size for lower gradient changing parameters. The convergence analysis is done using the regret bound approach of the online learning framework. In this article, thorough analysis is made over three synthetic complex nonconvex functions. The image categorization experiments are also conducted over the CIFAR10 and CIFAR100 data sets to observe the performance of diffGrad with respect to the state-of-the-art optimizers such as SGDM, AdaGrad, AdaDelta, RMSProp, AMSGrad, and Adam. The residual unit (ResNet)-based convolutional neural network (CNN) architecture is used in the experiments. The experiments show that diffGrad outperforms other optimizers. Also, we show that diffGrad performs uniformly well for training CNN using different activation functions. The source code is made publicly available at https://github.com/shivram1987/diffGrad.

Year:  2020        PMID: 31880565     DOI: 10.1109/TNNLS.2019.2955777

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  6 in total

1.  Automatic Segmentation of Clinical Target Volume and Organs-at-Risk for Breast Conservative Radiotherapy Using a Convolutional Neural Network.

Authors:  Fangjie Liu; Wanqi Chen; Zhikai Liu; Yinjie Tao; Xia Liu; Fuquan Zhang; Jing Shen; Hui Guan; Hongnan Zhen; Shaobin Wang; Qi Chen; Yu Chen; Xiaorong Hou
Journal:  Cancer Manag Res       Date:  2021-11-02       Impact factor: 3.989

2.  Robust Spatial-Spectral Squeeze-Excitation AdaBound Dense Network (SE-AB-Densenet) for Hyperspectral Image Classification.

Authors:  Kavitha Munishamaiaha; Gayathri Rajagopal; Dhilip Kumar Venkatesan; Muhammad Arif; Dragos Vicoveanu; Iuliana Chiuchisan; Diana Izdrui; Oana Geman
Journal:  Sensors (Basel)       Date:  2022-04-22       Impact factor: 3.847

3.  Hyperparameter Optimization Method Based on Harmony Search Algorithm to Improve Performance of 1D CNN Human Respiration Pattern Recognition System.

Authors:  Seong-Hoon Kim; Zong Woo Geem; Gi-Tae Han
Journal:  Sensors (Basel)       Date:  2020-07-01       Impact factor: 3.576

4.  Age-group determination of living individuals using first molar images based on artificial intelligence.

Authors:  Seunghyeon Kim; Yeon-Hee Lee; Yung-Kyun Noh; Frank C Park; Q-Schick Auh
Journal:  Sci Rep       Date:  2021-01-13       Impact factor: 4.379

5.  Use of machine learning in osteoarthritis research: a systematic literature review.

Authors:  Encarnita Mariotti-Ferrandiz; Jérémie Sellam; Marie Binvignat; Valentina Pedoia; Atul J Butte; Karine Louati; David Klatzmann; Francis Berenbaum
Journal:  RMD Open       Date:  2022-03

6.  Development of Novel Artificial Intelligence to Detect the Presence of Clinically Meaningful Coronary Atherosclerotic Stenosis in Major Branch from Coronary Angiography Video.

Authors:  Hiroto Yabushita; Shinichi Goto; Sunao Nakamura; Hideki Oka; Masamitsu Nakayama; Shinya Goto
Journal:  J Atheroscler Thromb       Date:  2020-10-02       Impact factor: 4.928

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.