Literature DB >> 18491550

Classifier performance prediction for computer-aided diagnosis using a limited dataset.

Berkman Sahiner1, Heang-Ping Chan, Lubomir Hadjiiski.   

Abstract

In a practical classifier design problem, the true population is generally unknown and the available sample is finite-sized. A common approach is to use a resampling technique to estimate the performance of the classifier that will be trained with the available sample. We conducted a Monte Carlo simulation study to compare the ability of the different resampling techniques in training the classifier and predicting its performance under the constraint of a finite-sized sample. The true population for the two classes was assumed to be multivariate normal distributions with known covariance matrices. Finite sets of sample vectors were drawn from the population. The true performance of the classifier is defined as the area under the receiver operating characteristic curve (AUC) when the classifier designed with the specific sample is applied to the true population. We investigated methods based on the Fukunaga-Hayes and the leave-one-out techniques, as well as three different types of bootstrap methods, namely, the ordinary, 0.632, and 0.632+ bootstrap. The Fisher's linear discriminant analysis was used as the classifier. The dimensionality of the feature space was varied from 3 to 15. The sample size n2 from the positive class was varied between 25 and 60, while the number of cases from the negative class was either equal to n2 or 3n2. Each experiment was performed with an independent dataset randomly drawn from the true population. Using a total of 1000 experiments for each simulation condition, we compared the bias, the variance, and the root-mean-squared error (RMSE) of the AUC estimated using the different resampling techniques relative to the true AUC (obtained from training on a finite dataset and testing on the population). Our results indicated that, under the study conditions, there can be a large difference in the RMSE obtained using different resampling methods, especially when the feature space dimensionality is relatively large and the sample size is small. Under this type of conditions, the 0.632 and 0.632+ bootstrap methods have the lowest RMSE, indicating that the difference between the estimated and the true performances obtained using the 0.632 and 0.632+ bootstrap will be statistically smaller than those obtained using the other three resampling methods. Of the three bootstrap methods, the 0.632+ bootstrap provides the lowest bias. Although this investigation is performed under some specific conditions, it reveals important trends for the problem of classifier performance prediction under the constraint of a limited dataset.

Mesh:

Year:  2008        PMID: 18491550      PMCID: PMC2811557          DOI: 10.1118/1.2868757

Source DB:  PubMed          Journal:  Med Phys        ISSN: 0094-2405            Impact factor:   4.071


  7 in total

1.  Validation procedures in radiologic diagnostic models. Neural network and logistic regression.

Authors:  E Arana; P Delicado; L Martí-Bonmatí
Journal:  Invest Radiol       Date:  1999-10       Impact factor: 6.016

2.  Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.

Authors:  H P Chan; B Sahiner; R F Wagner; N Petrick
Journal:  Med Phys       Date:  1999-12       Impact factor: 4.071

3.  Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.

Authors:  E W Steyerberg; F E Harrell; G J Borsboom; M J Eijkemans; Y Vergouwe; J D Habbema
Journal:  J Clin Epidemiol       Date:  2001-08       Impact factor: 6.437

4.  Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data.

Authors:  C E Metz; B A Herman; J H Shen
Journal:  Stat Med       Date:  1998-05-15       Impact factor: 2.373

5.  A comparison of parametric and nonparametric approaches to ROC analysis of quantitative diagnostic tests.

Authors:  K O Hajian-Tilaki; J A Hanley; L Joseph; J P Collet
Journal:  Med Decis Making       Date:  1997 Jan-Mar       Impact factor: 2.583

6.  The effect of data sampling on the performance evaluation of artificial neural networks in medical diagnosis.

Authors:  G D Tourassi; C E Floyd
Journal:  Med Decis Making       Date:  1997 Apr-Jun       Impact factor: 2.583

7.  An almost unbiased method of obtaining confidence intervals for the probability of misclassification in discriminant analysis.

Authors:  P A Lachenbruch
Journal:  Biometrics       Date:  1967-12       Impact factor: 2.571

  7 in total
  37 in total

1.  Exploring nonlinear feature space dimension reduction and data representation in breast Cadx with Laplacian eigenmaps and t-SNE.

Authors:  Andrew R Jamieson; Maryellen L Giger; Karen Drukker; Hui Li; Yading Yuan; Neha Bhooshan
Journal:  Med Phys       Date:  2010-01       Impact factor: 4.071

2.  Enhancement of breast CADx with unlabeled data.

Authors:  Andrew R Jamieson; Maryellen L Giger; Karen Drukker; Lorenzo L Pesce
Journal:  Med Phys       Date:  2010-08       Impact factor: 4.071

Review 3.  Anniversary paper: History and status of CAD and quantitative image analysis: the role of Medical Physics and AAPM.

Authors:  Maryellen L Giger; Heang-Ping Chan; John Boone
Journal:  Med Phys       Date:  2008-12       Impact factor: 4.071

4.  Automated regional registration and characterization of corresponding microcalcification clusters on temporal pairs of mammograms for interval change analysis.

Authors:  Peter Filev; Lubomir Hadjiiski; Heang-Ping Chan; Berkman Sahiner; Jun Ge; Mark A Helvie; Marilyn Roubidoux; Chuan Zhou
Journal:  Med Phys       Date:  2008-12       Impact factor: 4.071

5.  Estimation of channelized hotelling observer performance with known class means or known difference of class means.

Authors:  Adam Wunderlich; Frédéric Noo
Journal:  IEEE Trans Med Imaging       Date:  2009-01-19       Impact factor: 10.048

6.  Machine Learning in Computer-aided Diagnosis of the Thorax and Colon in CT: A Survey.

Authors:  Kenji Suzuki
Journal:  IEICE Trans Inf Syst       Date:  2013-04-01

7.  Development and evaluation of a computer-aided diagnostic scheme for lung nodule detection in chest radiographs by means of two-stage nodule enhancement with support vector classification.

Authors:  Sheng Chen; Kenji Suzuki; Heber MacMahon
Journal:  Med Phys       Date:  2011-04       Impact factor: 4.071

8.  CT colonography: advanced computer-aided detection scheme utilizing MTANNs for detection of "missed" polyps in a multicenter clinical trial.

Authors:  Kenji Suzuki; Don C Rockey; Abraham H Dachman
Journal:  Med Phys       Date:  2010-01       Impact factor: 4.071

9.  Dynamic multiple thresholding breast boundary detection algorithm for mammograms.

Authors:  Yi-Ta Wu; Chuan Zhou; Heang-Ping Chan; Chintana Paramagul; Lubomir M Hadjiiski; Caroline Plowden Daly; Julie A Douglas; Yiheng Zhang; Berkman Sahiner; Jiazheng Shi; Jun Wei
Journal:  Med Phys       Date:  2010-01       Impact factor: 4.071

10.  A review of computer-aided diagnosis in thoracic and colonic imaging.

Authors:  Kenji Suzuki
Journal:  Quant Imaging Med Surg       Date:  2012-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.