Literature DB >> 16613833

Sample size planning for developing classifiers using high-dimensional DNA microarray data.

Kevin K Dobbin1, Richard M Simon.   

Abstract

Many gene expression studies attempt to develop a predictor of pre-defined diagnostic or prognostic classes. If the classes are similar biologically, then the number of genes that are differentially expressed between the classes is likely to be small compared to the total number of genes measured. This motivates a two-step process for predictor development, a subset of differentially expressed genes is selected for use in the predictor and then the predictor constructed from these. Both these steps will introduce variability into the resulting classifier, so both must be incorporated in sample size estimation. We introduce a methodology for sample size determination for prediction in the context of high-dimensional data that captures variability in both steps of predictor development. The methodology is based on a parametric probability model, but permits sample size computations to be carried out in a practical manner without extensive requirements for preliminary data. We find that many prediction problems do not require a large training set of arrays for classifier development.

Mesh:

Year:  2006        PMID: 16613833     DOI: 10.1093/biostatistics/kxj036

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  41 in total

1.  Improving the quality of biomarker discovery research: the right samples and enough of them.

Authors:  Margaret S Pepe; Christopher I Li; Ziding Feng
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2015-04-02       Impact factor: 4.254

Review 2.  Microarray-based expression profiling and informatics.

Authors:  Richard Simon
Journal:  Curr Opin Biotechnol       Date:  2007-11-28       Impact factor: 9.740

3.  A method for constructing a confidence bound for the actual error rate of a prediction rule in high dimensions.

Authors:  Kevin K Dobbin
Journal:  Biostatistics       Date:  2008-11-27       Impact factor: 5.899

4.  Using cross-validation to evaluate predictive accuracy of survival risk classifiers based on high-dimensional data.

Authors:  Richard M Simon; Jyothi Subramanian; Ming-Chung Li; Supriya Menezes
Journal:  Brief Bioinform       Date:  2011-02-15       Impact factor: 11.622

Review 5.  Using microarrays to study the microenvironment in tumor biology: the crucial role of statistics.

Authors:  Stuart G Baker; Barnett S Kramer
Journal:  Semin Cancer Biol       Date:  2008-03-26       Impact factor: 15.707

6.  HOX expression patterns identify a common signature for favorable AML.

Authors:  M Andreeff; V Ruvolo; S Gadgil; C Zeng; K Coombes; W Chen; S Kornblau; A E Barón; H A Drabkin
Journal:  Leukemia       Date:  2008-07-31       Impact factor: 11.528

7.  Preferred analysis methods for Affymetrix GeneChips. II. An expanded, balanced, wholly-defined spike-in dataset.

Authors:  Qianqian Zhu; Jeffrey C Miecznikowski; Marc S Halfon
Journal:  BMC Bioinformatics       Date:  2010-05-27       Impact factor: 3.169

8.  Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms.

Authors:  Yu Guo; Armin Graber; Robert N McBurney; Raji Balasubramanian
Journal:  BMC Bioinformatics       Date:  2010-09-03       Impact factor: 3.169

9.  Factors influencing the statistical power of complex data analysis protocols for molecular signature development from microarray data.

Authors:  Constantin F Aliferis; Alexander Statnikov; Ioannis Tsamardinos; Jonathan S Schildcrout; Bryan E Shepherd; Frank E Harrell
Journal:  PLoS One       Date:  2009-03-17       Impact factor: 3.240

10.  Emerging concepts in biomarker discovery; the US-Japan Workshop on Immunological Molecular Markers in Oncology.

Authors:  Hideaki Tahara; Marimo Sato; Magdalena Thurin; Ena Wang; Lisa H Butterfield; Mary L Disis; Bernard A Fox; Peter P Lee; Samir N Khleif; Jon M Wigginton; Stefan Ambs; Yasunori Akutsu; Damien Chaussabel; Yuichiro Doki; Oleg Eremin; Wolf Hervé Fridman; Yoshihiko Hirohashi; Kohzoh Imai; James Jacobson; Masahisa Jinushi; Akira Kanamoto; Mohammed Kashani-Sabet; Kazunori Kato; Yutaka Kawakami; John M Kirkwood; Thomas O Kleen; Paul V Lehmann; Lance Liotta; Michael T Lotze; Michele Maio; Anatoli Malyguine; Giuseppe Masucci; Hisahiro Matsubara; Shawmarie Mayrand-Chung; Kiminori Nakamura; Hiroyoshi Nishikawa; A Karolina Palucka; Emanuel F Petricoin; Zoltan Pos; Antoni Ribas; Licia Rivoltini; Noriyuki Sato; Hiroshi Shiku; Craig L Slingluff; Howard Streicher; David F Stroncek; Hiroya Takeuchi; Minoru Toyota; Hisashi Wada; Xifeng Wu; Julia Wulfkuhle; Tomonori Yaguchi; Benjamin Zeskind; Yingdong Zhao; Mai-Britt Zocca; Francesco M Marincola
Journal:  J Transl Med       Date:  2009-06-17       Impact factor: 5.531

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.