Literature DB >> 24771879

The Dropout Learning Algorithm.

Pierre Baldi1, Peter Sadowski1.   

Abstract

Dropout is a recently introduced algorithm for training neural network by randomly dropping units during training to prevent their co-adaptation. A mathematical analysis of some of the static and dynamic properties of dropout is provided using Bernoulli gating variables, general enough to accommodate dropout on units or connections, and with variable rates. The framework allows a complete analysis of the ensemble averaging properties of dropout in linear networks, which is useful to understand the non-linear case. The ensemble averaging properties of dropout in non-linear logistic networks result from three fundamental equations: (1) the approximation of the expectations of logistic functions by normalized geometric means, for which bounds and estimates are derived; (2) the algebraic equality between normalized geometric means of logistic functions with the logistic of the means, which mathematically characterizes logistic functions; and (3) the linearity of the means with respect to sums, as well as products of independent variables. The results are also extended to other classes of transfer functions, including rectified linear functions. Approximation errors tend to cancel each other and do not accumulate. Dropout can also be connected to stochastic neurons and used to predict firing rates, and to backpropagation by viewing the backward propagation as ensemble averaging in a dropout linear network. Moreover, the convergence properties of dropout can be understood in terms of stochastic gradient descent. Finally, for the regularization properties of dropout, the expectation of the dropout gradient is the gradient of the corresponding approximation ensemble, regularized by an adaptive weight decay term with a propensity for self-consistent variance minimization and sparse representations.

Entities:  

Keywords:  backpropagation; ensemble; geometric mean; machine learning; neural networks; regularization; sparse representations; stochastic gradient descent; stochastic neurons; variance minimization

Year:  2014        PMID: 24771879      PMCID: PMC3996711          DOI: 10.1016/j.artint.2014.02.004

Source DB:  PubMed          Journal:  Artif Intell        ISSN: 0004-3702            Impact factor:   9.088


  5 in total

1.  Enhanced MLP performance and fault tolerance resulting from synaptic weight noise during training.

Authors:  A F Murray; P J Edwards
Journal:  IEEE Trans Neural Netw       Date:  1994

2.  Learning in linear neural networks: a survey.

Authors:  P F Baldi; K Hornik
Journal:  IEEE Trans Neural Netw       Date:  1995

3.  Interaural time and intensity coding in superior olivary complex and inferior colliculus of the echolocating bat Molossus ater.

Authors:  G Harnischfeger; G Neuweiler; P Schlegel
Journal:  J Neurophysiol       Date:  1985-01       Impact factor: 2.714

4.  A circuit for detection of interaural time differences in the brain stem of the barn owl.

Authors:  C E Carr; M Konishi
Journal:  J Neurosci       Date:  1990-10       Impact factor: 6.167

5.  Axonal delay lines for time measurement in the owl's brainstem.

Authors:  C E Carr; M Konishi
Journal:  Proc Natl Acad Sci U S A       Date:  1988-11       Impact factor: 11.205

  5 in total
  25 in total

1.  Learning in the Machine: Random Backpropagation and the Deep Learning Channel.

Authors:  Pierre Baldi; Peter Sadowski; Zhiqin Lu
Journal:  Artif Intell       Date:  2018-04-03       Impact factor: 9.088

2.  Convolutional Neural Network for Segmentation and Measurement of Intima Media Thickness.

Authors:  Sudha S; Jayanthi K B; Rajasekaran C; Nirmala Madian; Sunder T
Journal:  J Med Syst       Date:  2018-07-09       Impact factor: 4.460

3.  SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity.

Authors:  Christophe N Magnan; Pierre Baldi
Journal:  Bioinformatics       Date:  2014-05-24       Impact factor: 6.937

4.  A multi-resolution approach for spinal metastasis detection using deep Siamese neural networks.

Authors:  Juan Wang; Zhiyuan Fang; Ning Lang; Huishu Yuan; Min-Ying Su; Pierre Baldi
Journal:  Comput Biol Med       Date:  2017-03-27       Impact factor: 4.589

5.  Deep Learning Localizes and Identifies Polyps in Real Time With 96% Accuracy in Screening Colonoscopy.

Authors:  Gregor Urban; Priyam Tripathi; Talal Alkayali; Mohit Mittal; Farid Jalali; William Karnes; Pierre Baldi
Journal:  Gastroenterology       Date:  2018-06-18       Impact factor: 22.682

6.  Development and Validation of a Deep Neural Network Model for Prediction of Postoperative In-hospital Mortality.

Authors:  Christine K Lee; Ira Hofer; Eilon Gabel; Pierre Baldi; Maxime Cannesson
Journal:  Anesthesiology       Date:  2018-10       Impact factor: 7.892

7.  Differentiation of spinal metastases originated from lung and other cancers using radiomics and deep learning based on DCE-MRI.

Authors:  Ning Lang; Yang Zhang; Enlong Zhang; Jiahui Zhang; Daniel Chow; Peter Chang; Hon J Yu; Huishu Yuan; Min-Ying Su
Journal:  Magn Reson Imaging       Date:  2019-02-28       Impact factor: 2.546

8.  Detecting Cardiovascular Disease from Mammograms With Deep Learning.

Authors:  Juan Wang; Huanjun Ding; Fatemeh Azamian Bidgoli; Brian Zhou; Carlos Iribarren; Sabee Molloi; Pierre Baldi
Journal:  IEEE Trans Med Imaging       Date:  2017-01-19       Impact factor: 10.048

9.  Inter-species prediction of protein phosphorylation in the sbv IMPROVER species translation challenge.

Authors:  Michael Biehl; Peter Sadowski; Gyan Bhanot; Erhan Bilal; Adel Dayarian; Pablo Meyer; Raquel Norel; Kahn Rhrissorrakrai; Michael D Zeller; Sahand Hormoz
Journal:  Bioinformatics       Date:  2014-07-03       Impact factor: 6.937

10.  Development of Predictive Models in Patients with Epiphora Using Lacrimal Scintigraphy and Machine Learning.

Authors:  Yong-Jin Park; Ji Hoon Bae; Mu Heon Shin; Seung Hyup Hyun; Young Seok Cho; Yearn Seong Choe; Joon Young Choi; Kyung-Han Lee; Byung-Tae Kim; Seung Hwan Moon
Journal:  Nucl Med Mol Imaging       Date:  2019-02-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.