Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Estimating dataset size requirements for classifying DNA microarray data.

Literature DB >> 12804087

Estimating dataset size requirements for classifying DNA microarray data.

Sayan Mukherjee¹, Pablo Tamayo, Simon Rogers, Ryan Rifkin, Anna Engle, Colin Campbell, Todd R Golub, Jill P Mesirov.

Abstract

A statistical methodology for estimating dataset size requirements for classifying microarray data using learning curves is introduced. The goal is to use existing classification results to estimate dataset size requirements for future classification experiments and to evaluate the gain in accuracy and significance of classifiers built with additional data. The method is based on fitting inverse power-law models to construct empirical learning curves. It also includes a permutation test procedure to assess the statistical significance of classification performance for a given dataset size. This procedure is applied to several molecular classification problems representing a broad spectrum of levels of complexity.

Entities: Disease

Mesh：

Year: 2003 PMID： 12804087 DOI： 10.1089/106652703321825928

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

Keyword Cloud
Cited

58 in total

1. An empirical Bayes' approach to joint analysis of multiple microarray gene expression studies.

Authors: Lingyan Ruan; Ming Yuan
Journal: Biometrics Date: 2011-04-22 Impact factor: 2.571

2. Learning curves in classification with microarray data.

Authors: Kenneth R Hess; Caimiao Wei
Journal: Semin Oncol Date: 2010-02 Impact factor: 4.929

Review 3. Gene expression profiling of primary breast cancer.

Authors: Roman Rouzier; Peter Wagner; Paolo Morandi; Lajos Pusztai
Journal: Curr Oncol Rep Date: 2005-01 Impact factor: 5.075

4. Utilization of lymphoblastoid cell lines as a system for the molecular modeling of autism.

Authors: Colin A Baron; Stephenie Y Liu; Chindo Hicks; Jeffrey P Gregg
Journal: J Autism Dev Disord Date: 2006-11

5. A method for constructing a confidence bound for the actual error rate of a prediction rule in high dimensions.

Authors: Kevin K Dobbin
Journal: Biostatistics Date: 2008-11-27 Impact factor: 5.899

6. Development and Validation of Biomarker Classifiers for Treatment Selection.

Authors: Richard Simon
Journal: J Stat Plan Inference Date: 2008-02-01 Impact factor: 1.111

7. Sample size requirements for training high-dimensional risk predictors.

Authors: Kevin K Dobbin; Xiao Song
Journal: Biostatistics Date: 2013-07-19 Impact factor: 5.899

Review 8. Clinical uses of microarrays in cancer research.

Authors: Carl Virtanen; James Woodgett
Journal: Methods Mol Med Date: 2008

9. Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research.

Authors: Leif E Peterson; Matthew A Coleman
Journal: Int J Approx Reason Date: 2008-01 Impact factor: 3.816

10. Modeling cancer progression via pathway dependencies.

Authors: Elena J Edelman; Justin Guinney; Jen-Tsan Chi; Phillip G Febbo; Sayan Mukherjee
Journal: PLoS Comput Biol Date: 2008-02 Impact factor: 4.475