Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Classifier performance prediction for computer-aided diagnosis using a limited dataset.

Literature DB >> 18491550

Classifier performance prediction for computer-aided diagnosis using a limited dataset.

Berkman Sahiner¹, Heang-Ping Chan, Lubomir Hadjiiski.

Abstract

In a practical classifier design problem, the true population is generally unknown and the available sample is finite-sized. A common approach is to use a resampling technique to estimate the performance of the classifier that will be trained with the available sample. We conducted a Monte Carlo simulation study to compare the ability of the different resampling techniques in training the classifier and predicting its performance under the constraint of a finite-sized sample. The true population for the two classes was assumed to be multivariate normal distributions with known covariance matrices. Finite sets of sample vectors were drawn from the population. The true performance of the classifier is defined as the area under the receiver operating characteristic curve (AUC) when the classifier designed with the specific sample is applied to the true population. We investigated methods based on the Fukunaga-Hayes and the leave-one-out techniques, as well as three different types of bootstrap methods, namely, the ordinary, 0.632, and 0.632+ bootstrap. The Fisher's linear discriminant analysis was used as the classifier. The dimensionality of the feature space was varied from 3 to 15. The sample size n2 from the positive class was varied between 25 and 60, while the number of cases from the negative class was either equal to n2 or 3n2. Each experiment was performed with an independent dataset randomly drawn from the true population. Using a total of 1000 experiments for each simulation condition, we compared the bias, the variance, and the root-mean-squared error (RMSE) of the AUC estimated using the different resampling techniques relative to the true AUC (obtained from training on a finite dataset and testing on the population). Our results indicated that, under the study conditions, there can be a large difference in the RMSE obtained using different resampling methods, especially when the feature space dimensionality is relatively large and the sample size is small. Under this type of conditions, the 0.632 and 0.632+ bootstrap methods have the lowest RMSE, indicating that the difference between the estimated and the true performances obtained using the 0.632 and 0.632+ bootstrap will be statistically smaller than those obtained using the other three resampling methods. Of the three bootstrap methods, the 0.632+ bootstrap provides the lowest bias. Although this investigation is performed under some specific conditions, it reveals important trends for the problem of classifier performance prediction under the constraint of a limited dataset.

Mesh：

Year: 2008 PMID： 18491550 PMCID： PMC2811557 DOI： 10.1118/1.2868757

Source DB: PubMed Journal: Med Phys ISSN： 0094-2405 Impact factor: 4.071

7 in total

1. Validation procedures in radiologic diagnostic models. Neural network and logistic regression.

Authors: E Arana; P Delicado; L Martí-Bonmatí
Journal: Invest Radiol Date: 1999-10 Impact factor: 6.016

2. Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.

Authors: H P Chan; B Sahiner; R F Wagner; N Petrick
Journal: Med Phys Date: 1999-12 Impact factor: 4.071

3. Internal validation of predictive models: efficiency of some procedures for logistic regression analysis.

Authors: E W Steyerberg; F E Harrell; G J Borsboom; M J Eijkemans; Y Vergouwe; J D Habbema
Journal: J Clin Epidemiol Date: 2001-08 Impact factor: 6.437

4. Maximum likelihood estimation of receiver operating characteristic (ROC) curves from continuously-distributed data.

Authors: C E Metz; B A Herman; J H Shen
Journal: Stat Med Date: 1998-05-15 Impact factor: 2.373

5. A comparison of parametric and nonparametric approaches to ROC analysis of quantitative diagnostic tests.

Authors: K O Hajian-Tilaki; J A Hanley; L Joseph; J P Collet
Journal: Med Decis Making Date: 1997 Jan-Mar Impact factor: 2.583

6. The effect of data sampling on the performance evaluation of artificial neural networks in medical diagnosis.

Authors: G D Tourassi; C E Floyd
Journal: Med Decis Making Date: 1997 Apr-Jun Impact factor: 2.583

7. An almost unbiased method of obtaining confidence intervals for the probability of misclassification in discriminant analysis.

Authors: P A Lachenbruch
Journal: Biometrics Date: 1967-12 Impact factor: 2.571

7 in total

37 in total

1. Exploring nonlinear feature space dimension reduction and data representation in breast Cadx with Laplacian eigenmaps and t-SNE.

Authors: Andrew R Jamieson; Maryellen L Giger; Karen Drukker; Hui Li; Yading Yuan; Neha Bhooshan
Journal: Med Phys Date: 2010-01 Impact factor: 4.071

2. Enhancement of breast CADx with unlabeled data.

Authors: Andrew R Jamieson; Maryellen L Giger; Karen Drukker; Lorenzo L Pesce
Journal: Med Phys Date: 2010-08 Impact factor: 4.071

Review 3. Anniversary paper: History and status of CAD and quantitative image analysis: the role of Medical Physics and AAPM.

Authors: Maryellen L Giger; Heang-Ping Chan; John Boone
Journal: Med Phys Date: 2008-12 Impact factor: 4.071

4. Automated regional registration and characterization of corresponding microcalcification clusters on temporal pairs of mammograms for interval change analysis.

Authors: Peter Filev; Lubomir Hadjiiski; Heang-Ping Chan; Berkman Sahiner; Jun Ge; Mark A Helvie; Marilyn Roubidoux; Chuan Zhou
Journal: Med Phys Date: 2008-12 Impact factor: 4.071

5. Estimation of channelized hotelling observer performance with known class means or known difference of class means.

Authors: Adam Wunderlich; Frédéric Noo
Journal: IEEE Trans Med Imaging Date: 2009-01-19 Impact factor: 10.048

6. Machine Learning in Computer-aided Diagnosis of the Thorax and Colon in CT: A Survey.

Authors: Kenji Suzuki
Journal: IEICE Trans Inf Syst Date: 2013-04-01

7. Development and evaluation of a computer-aided diagnostic scheme for lung nodule detection in chest radiographs by means of two-stage nodule enhancement with support vector classification.

Authors: Sheng Chen; Kenji Suzuki; Heber MacMahon
Journal: Med Phys Date: 2011-04 Impact factor: 4.071

8. CT colonography: advanced computer-aided detection scheme utilizing MTANNs for detection of "missed" polyps in a multicenter clinical trial.

Authors: Kenji Suzuki; Don C Rockey; Abraham H Dachman
Journal: Med Phys Date: 2010-01 Impact factor: 4.071

9. Dynamic multiple thresholding breast boundary detection algorithm for mammograms.

Authors: Yi-Ta Wu; Chuan Zhou; Heang-Ping Chan; Chintana Paramagul; Lubomir M Hadjiiski; Caroline Plowden Daly; Julie A Douglas; Yiheng Zhang; Berkman Sahiner; Jiazheng Shi; Jun Wei
Journal: Med Phys Date: 2010-01 Impact factor: 4.071

10. A review of computer-aided diagnosis in thoracic and colonic imaging.

Authors: Kenji Suzuki
Journal: Quant Imaging Med Surg Date: 2012-09