Literature DB >> 18234468

Classifier performance estimation under the constraint of a finite sample size: resampling schemes applied to neural network classifiers.

Berkman Sahiner1, Heang-Ping Chan, Lubomir Hadjiiski.   

Abstract

In a practical classifier design problem the sample size is limited, and the available finite sample needs to be used both to design a classifier and to predict the classifier's performance for the true population. Since a larger sample is more representative of the population, it is advantageous to design the classifier with all the available cases, and to use a resampling technique for performance prediction. We conducted a Monte Carlo simulation study to compare the ability of different resampling techniques in predicting the performance of a neural network (NN) classifier designed with the available sample. We used the area under the receiver operating characteristic curve as the performance index for the NN classifier. We investigated resampling techniques based on the cross-validation, the leave-one-out method, and three different types of bootstrapping, namely, the ordinary, .632, and .632+ bootstrap. Our results indicated that, under the study conditions, there can be a large difference in the accuracy of the prediction obtained from different resampling methods, especially when the feature space dimensionality is relatively large and the sample size is small. Although this investigation is performed under some specific conditions, it reveals important trends for the problem of classifier performance prediction under the constraint of a limited data set.

Mesh:

Year:  2007        PMID: 18234468      PMCID: PMC2729493          DOI: 10.1016/j.neunet.2007.12.012

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  3 in total

1.  Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.

Authors:  H P Chan; B Sahiner; R F Wagner; N Petrick
Journal:  Med Phys       Date:  1999-12       Impact factor: 4.071

2.  Classifier performance estimation under the constraint of a finite sample size: resampling schemes applied to neural network classifiers.

Authors:  Berkman Sahiner; Heang-Ping Chan; Lubomir Hadjiiski
Journal:  Neural Netw       Date:  2007-12-17

3.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data.

Authors:  C E Metz; B A Herman; J H Shen
Journal:  Stat Med       Date:  1998-05-15       Impact factor: 2.373

  3 in total
  9 in total

1.  Classifier performance estimation under the constraint of a finite sample size: resampling schemes applied to neural network classifiers.

Authors:  Berkman Sahiner; Heang-Ping Chan; Lubomir Hadjiiski
Journal:  Neural Netw       Date:  2007-12-17

2.  Noise injection for training artificial neural networks: a comparison with weight decay and early stopping.

Authors:  Richard M Zur; Yulei Jiang; Lorenzo L Pesce; Karen Drukker
Journal:  Med Phys       Date:  2009-10       Impact factor: 4.071

3.  Development and evaluation of a computer-aided diagnostic scheme for lung nodule detection in chest radiographs by means of two-stage nodule enhancement with support vector classification.

Authors:  Sheng Chen; Kenji Suzuki; Heber MacMahon
Journal:  Med Phys       Date:  2011-04       Impact factor: 4.071

4.  CT colonography: advanced computer-aided detection scheme utilizing MTANNs for detection of "missed" polyps in a multicenter clinical trial.

Authors:  Kenji Suzuki; Don C Rockey; Abraham H Dachman
Journal:  Med Phys       Date:  2010-01       Impact factor: 4.071

5.  Effect of finite sample size on feature selection and classification: a simulation study.

Authors:  Ted W Way; Berkman Sahiner; Lubomir M Hadjiiski; Heang-Ping Chan
Journal:  Med Phys       Date:  2010-02       Impact factor: 4.071

6.  Predicting 2-y outcome in preterm infants using early multimodal physiological monitoring.

Authors:  Rhodri O Lloyd; John M O'Toole; Vicki Livingstone; William D Hutch; Elena Pavlidis; Anne-Marie Cronin; Eugene M Dempsey; Peter M Filan; Geraldine B Boylan
Journal:  Pediatr Res       Date:  2016-04-18       Impact factor: 3.756

7.  Massive-training support vector regression and Gaussian process for false-positive reduction in computer-aided detection of polyps in CT colonography.

Authors:  Jian-Wu Xu; Kenji Suzuki
Journal:  Med Phys       Date:  2011-04       Impact factor: 4.071

8.  Comparison of noninvasive markers of fibrosis in patients with nonalcoholic fatty liver disease.

Authors:  Amy G Shah; Alison Lydecker; Karen Murray; Brent N Tetri; Melissa J Contos; Arun J Sanyal
Journal:  Clin Gastroenterol Hepatol       Date:  2009-06-10       Impact factor: 11.382

9.  Docking-generated multiple ligand poses for bootstrapping bioactivity classifying Machine Learning: Repurposing covalent inhibitors for COVID-19-related TMPRSS2 as case study.

Authors:  Ma'mon M Hatmal; Omar Abuyaman; Mutasem Taha
Journal:  Comput Struct Biotechnol J       Date:  2021-08-19       Impact factor: 7.271

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.