Literature DB >> 28668660

Accelerating deep neural network training with inconsistent stochastic gradient descent.

Linnan Wang1, Yi Yang2, Renqiang Min2, Srimat Chakradhar2.   

Abstract

Stochastic Gradient Descent (SGD) updates Convolutional Neural Network (CNN) with a noisy gradient computed from a random batch, and each batch evenly updates the network once in an epoch. This model applies the same training effort to each batch, but it overlooks the fact that the gradient variance, induced by Sampling Bias and Intrinsic Image Difference, renders different training dynamics on batches. In this paper, we develop a new training strategy for SGD, referred to as Inconsistent Stochastic Gradient Descent (ISGD) to address this problem. The core concept of ISGD is the inconsistent training, which dynamically adjusts the training effort w.r.t the loss. ISGD models the training as a stochastic process that gradually reduces down the mean of batch's loss, and it utilizes a dynamic upper control limit to identify a large loss batch on the fly. ISGD stays on the identified batch to accelerate the training with additional gradient updates, and it also has a constraint to penalize drastic parameter changes. ISGD is straightforward, computationally efficient and without requiring auxiliary memories. A series of empirical evaluations on real world datasets and networks demonstrate the promising performance of inconsistent training.
Copyright © 2017 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Neural networks; Statistical process control; Stochastic gradient descent

Mesh:

Year:  2017        PMID: 28668660     DOI: 10.1016/j.neunet.2017.06.003

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  2 in total

1.  Residual convolutional neural network for predicting response of transarterial chemoembolization in hepatocellular carcinoma from CT imaging.

Authors:  Jie Peng; Shuai Kang; Zhengyuan Ning; Hangxia Deng; Jingxian Shen; Yikai Xu; Jing Zhang; Wei Zhao; Xinling Li; Wuxing Gong; Jinhua Huang; Li Liu
Journal:  Eur Radiol       Date:  2019-07-22       Impact factor: 5.315

2.  Novel and versatile artificial intelligence algorithms for investigating possible GHSR1α and DRD1 agonists for Alzheimer's disease.

Authors:  Zi-Qiang Tang; Lu Zhao; Guan-Xing Chen; Calvin Yu-Chian Chen
Journal:  RSC Adv       Date:  2021-02-04       Impact factor: 3.361

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.