Literature DB >> 19928111

Noise injection for training artificial neural networks: a comparison with weight decay and early stopping.

Richard M Zur1, Yulei Jiang, Lorenzo L Pesce, Karen Drukker.   

Abstract

The purpose of this study was to investigate the effect of a noise injection method on the "overfitting" problem of artificial neural networks (ANNs) in two-class classification tasks. The authors compared ANNs trained with noise injection to ANNs trained with two other methods for avoiding overfitting: weight decay and early stopping. They also evaluated an automatic algorithm for selecting the magnitude of the noise injection. They performed simulation studies of an exclusive-or classification task with training datasets of 50, 100, and 200 cases (half normal and half abnormal) and an independent testing dataset of 2000 cases. They also compared the methods using a breast ultrasound dataset of 1126 cases. For simulated training datasets of 50 cases, the area under the receiver operating characteristic curve (AUC) was greater (by 0.03) when training with noise injection than when training without any regularization, and the improvement was greater than those from weight decay and early stopping (both of 0.02). For training datasets of 100 cases, noise injection and weight decay yielded similar increases in the AUC (0.02), whereas early stopping produced a smaller increase (0.01). For training datasets of 200 cases, the increases in the AUC were negligibly small for all methods (0.005). For the ultrasound dataset, noise injection had a greater average AUC than ANNs trained without regularization and a slightly greater average AUC than ANNs trained with weight decay. These results indicate that training ANNs with noise injection can reduce overfitting to a greater degree than early stopping and to a similar degree as weight decay.

Mesh:

Year:  2009        PMID: 19928111      PMCID: PMC2771718          DOI: 10.1118/1.3213517

Source DB:  PubMed          Journal:  Med Phys        ISSN: 0094-2405            Impact factor:   4.071


  14 in total

1.  Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.

Authors:  H P Chan; B Sahiner; R F Wagner; N Petrick
Journal:  Med Phys       Date:  1999-12       Impact factor: 4.071

2.  "Proper" Binormal ROC Curves: Theory and Maximum-Likelihood Estimation.

Authors: 
Journal:  J Math Psychol       Date:  1999-03       Impact factor: 2.223

3.  Ideal observer approximation using Bayesian classification neural networks.

Authors:  M A Kupinski; D C Edwards; M L Giger; C E Metz
Journal:  IEEE Trans Med Imaging       Date:  2001-09       Impact factor: 10.048

4.  Uncertainty in the output of artificial neural networks.

Authors:  Yulei Jiang
Journal:  IEEE Trans Med Imaging       Date:  2003-07       Impact factor: 10.048

5.  Using additive noise in back-propagation training.

Authors:  L Holmstrom; P Koistinen
Journal:  IEEE Trans Neural Netw       Date:  1992

6.  Classifier performance estimation under the constraint of a finite sample size: resampling schemes applied to neural network classifiers.

Authors:  Berkman Sahiner; Heang-Ping Chan; Lubomir Hadjiiski
Journal:  Neural Netw       Date:  2007-12-17

7.  Experimental design and data analysis in receiver operating characteristic studies: lessons learned from reports in radiology from 1997 to 2006.

Authors:  Junji Shiraishi; Lorenzo L Pesce; Charles E Metz; Kunio Doi
Journal:  Radiology       Date:  2009-10-28       Impact factor: 11.105

Review 8.  ROC methodology in radiologic imaging.

Authors:  C E Metz
Journal:  Invest Radiol       Date:  1986-09       Impact factor: 6.016

9.  Simulation studies of data classification by artificial neural networks: potential applications in medical imaging and decision making.

Authors:  Y Wu; K Doi; C E Metz; N Asada; M L Giger
Journal:  J Digit Imaging       Date:  1993-05       Impact factor: 4.056

10.  Breast US computer-aided diagnosis workstation: performance with a large clinical diagnostic population.

Authors:  Karen Drukker; Nicholas P Gruszauskas; Charlene A Sennett; Maryellen L Giger
Journal:  Radiology       Date:  2008-06-23       Impact factor: 11.105

View more
  10 in total

1.  QSAR modeling without descriptors using graph convolutional neural networks: the case of mutagenicity prediction.

Authors:  Chiakang Hung; Giuseppina Gini
Journal:  Mol Divers       Date:  2021-06-19       Impact factor: 2.943

2.  The Presence of Background Noise Extends the Competitor Space in Native and Non-Native Spoken-Word Recognition: Insights from Computational Modeling.

Authors:  Themis Karaminis; Florian Hintz; Odette Scharenborg
Journal:  Cogn Sci       Date:  2022-02

3.  Systems Biology Approach to Model the Life Cycle of Trypanosoma cruzi.

Authors:  Alejandra Carrea; Luis Diambra
Journal:  PLoS One       Date:  2016-01-11       Impact factor: 3.240

4.  Just-in-Time Correntropy Soft Sensor with Noisy Data for Industrial Silicon Content Prediction.

Authors:  Kun Chen; Yu Liang; Zengliang Gao; Yi Liu
Journal:  Sensors (Basel)       Date:  2017-08-08       Impact factor: 3.576

5.  Information-rich localization microscopy through machine learning.

Authors:  Taehwan Kim; Seonah Moon; Ke Xu
Journal:  Nat Commun       Date:  2019-04-30       Impact factor: 14.919

6.  Robust Understanding of Robot-Directed Speech Commands Using Sequence to Sequence With Noise Injection.

Authors:  Yuuki Tada; Yoshinobu Hagiwara; Hiroki Tanaka; Tadahiro Taniguchi
Journal:  Front Robot AI       Date:  2020-01-14

7.  An end-to-end AI-based framework for automated discovery of rapid CEST/MT MRI acquisition protocols and molecular parameter quantification (AutoCEST).

Authors:  Or Perlman; Bo Zhu; Moritz Zaiss; Matthew S Rosen; Christian T Farrar
Journal:  Magn Reson Med       Date:  2022-01-28       Impact factor: 3.737

Review 8.  Machine and deep learning methods for radiomics.

Authors:  Michele Avanzo; Lise Wei; Joseph Stancanello; Martin Vallières; Arvind Rao; Olivier Morin; Sarah A Mattonen; Issam El Naqa
Journal:  Med Phys       Date:  2020-06       Impact factor: 4.071

9.  Discerning apical and basolateral properties of HT-29/B6 and IPEC-J2 cell layers by impedance spectroscopy, mathematical modeling and machine learning.

Authors:  Thomas Schmid; Martin Bogdan; Dorothee Günzel
Journal:  PLoS One       Date:  2013-07-01       Impact factor: 3.240

10.  Deep learning with robustness to missing data: A novel approach to the detection of COVID-19.

Authors:  Erdi Çallı; Keelin Murphy; Steef Kurstjens; Tijs Samson; Robert Herpers; Henk Smits; Matthieu Rutten; Bram van Ginneken
Journal:  PLoS One       Date:  2021-07-30       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.