Literature DB >> 10619251

Classifier design for computer-aided diagnosis: effects of finite sample size on the mean performance of classical and neural network classifiers.

H P Chan1, B Sahiner, R F Wagner, N Petrick.   

Abstract

Classifier design is one of the key steps in the development of computer-aided diagnosis (CAD) algorithms. A classifier is designed with case samples drawn from the patient population. Generally, the sample size available for classifier design is limited, which introduces variance and bias into the performance of the trained classifier, relative to that obtained with an infinite sample size. For CAD applications, a commonly used performance index for a classifier is the area, Az, under the receiver operating characteristic (ROC) curve. We have conducted a computer simulation study to investigate the dependence of the mean performance, in terms of Az, on design sample size for a linear discriminant and two nonlinear classifiers, the quadratic discriminant and the backpropagation neural network (ANN). The performances of the classifiers were compared for four types of class distributions that have specific properties: multivariate normal distributions with equal covariance matrices and unequal means, unequal covariance matrices and unequal means, and unequal covariance matrices and equal means, and a feature space where the two classes were uniformly distributed in disjoint checkerboard regions. We evaluated the performances of the classifiers in feature spaces of dimensionality ranging from 3 to 15, and design sample sizes from 20 to 800 per class. The dependence of the resubstitution and hold-out performance on design (training) sample size (Nt) was investigated. For multivariate normal class distributions with equal covariance matrices, the linear discriminant is the optimal classifier. It was found that its Az-versus-1/Nt curves can be closely approximated by linear dependences over the range of sample sizes studied. In the feature spaces with unequal covariance matrices where the quadratic discriminant is optimal, the linear discriminant is inferior to the quadratic discriminant or the ANN when the design sample size is large. However, when the design sample is small, a relatively simple classifier, such as the linear discriminant or an ANN with very few hidden nodes, may be preferred because performance bias increases with the complexity of the classifier. In the regime where the classifier performance is dominated by the 1/Nt term, the performance in the limit of infinite sample size can be estimated as the intercept (1/Nt= 0) of a linear regression of Az versus 1/Nt. The understanding of the performance of the classifiers under the constraint of a finite design sample size is expected to facilitate the selection of a proper classifier for a given classification task and the design of an efficient resampling scheme.

Entities:  

Mesh:

Year:  1999        PMID: 10619251     DOI: 10.1118/1.598805

Source DB:  PubMed          Journal:  Med Phys        ISSN: 0094-2405            Impact factor:   4.071


  48 in total

1.  Feature selection and classifier performance in computer-aided diagnosis: the effect of finite sample size.

Authors:  B Sahiner; H P Chan; N Petrick; R F Wagner; L Hadjiiski
Journal:  Med Phys       Date:  2000-07       Impact factor: 4.071

2.  Characterization of masses in digital breast tomosynthesis: comparison of machine learning in projection views and reconstructed slices.

Authors:  Heang-Ping Chan; Yi-Ta Wu; Berkman Sahiner; Jun Wei; Mark A Helvie; Yiheng Zhang; Richard H Moore; Daniel B Kopans; Lubomir Hadjiiski; Ted Way
Journal:  Med Phys       Date:  2010-07       Impact factor: 4.071

3.  Evaluating imaging and computer-aided detection and diagnosis devices at the FDA.

Authors:  Brandon D Gallas; Heang-Ping Chan; Carl J D'Orsi; Lori E Dodd; Maryellen L Giger; David Gur; Elizabeth A Krupinski; Charles E Metz; Kyle J Myers; Nancy A Obuchowski; Berkman Sahiner; Alicia Y Toledano; Margarita L Zuley
Journal:  Acad Radiol       Date:  2012-02-03       Impact factor: 3.173

4.  Pioneers in Medical Imaging: Honoring the Memory of Robert F. Wagner.

Authors:  Kyle J Myers; Weijie Chen
Journal:  J Med Imaging (Bellingham)       Date:  2014-10

5.  Evaluation of data augmentation via synthetic images for improved breast mass detection on mammograms using deep learning.

Authors:  Kenny H Cha; Nicholas Petrick; Aria Pezeshk; Christian G Graff; Diksha Sharma; Andreu Badal; Berkman Sahiner
Journal:  J Med Imaging (Bellingham)       Date:  2019-11-22

6.  Potential of computer-aided diagnosis of high spectral and spatial resolution (HiSS) MRI in the classification of breast lesions.

Authors:  Neha Bhooshan; Maryellen Giger; Milica Medved; Hui Li; Abbie Wood; Yading Yuan; Li Lan; Angelica Marquez; Greg Karczmar; Gillian Newstead
Journal:  J Magn Reson Imaging       Date:  2013-09-10       Impact factor: 4.813

7.  Computer-aided diagnosis for improved detection of lung nodules by use of posterior-anterior and lateral chest radiographs.

Authors:  Junji Shiraishi; Feng Li; Kunio Doi
Journal:  Acad Radiol       Date:  2007-01       Impact factor: 3.173

8.  Reliable evaluation of performance level for computer-aided diagnostic scheme.

Authors:  Qiang Li
Journal:  Acad Radiol       Date:  2007-08       Impact factor: 3.173

9.  Computer-aided detection systems for breast masses: comparison of performances on full-field digital mammograms and digitized screen-film mammograms.

Authors:  Jun Wei; Lubomir M Hadjiiski; Berkman Sahiner; Heang-Ping Chan; Jun Ge; Marilyn A Roubidoux; Mark A Helvie; Chuan Zhou; Yi-Ta Wu; Chintana Paramagul; Yiheng Zhang
Journal:  Acad Radiol       Date:  2007-06       Impact factor: 3.173

10.  A review of computer-aided diagnosis in thoracic and colonic imaging.

Authors:  Kenji Suzuki
Journal:  Quant Imaging Med Surg       Date:  2012-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.