Literature DB >> 15579240

Sample size for identifying differentially expressed genes in microarray experiments.

Sue-Jane Wang1, James J Chen.   

Abstract

Microarray technology allows simultaneous comparison of expression levels of thousands of genes under each condition. This paper concerns sample size calculation in the identification of differentially expressed genes between a control and a treated sample. In a typical experiment, only a fraction of genes (altered genes) is expected to be differentially expressed between two samples. Sample size determination depends on a number of factors including the specified significance level (alpha), the desired statistical power (1-beta), the fraction (eta) of truly altered genes out of the total g genes studied, and the effect sizes (Delta) for the altered genes. This paper proposes a method to calculate the number of arrays required to detect at least 100lambda % (where 0 < lambda < or = 1) of the truly altered genes under the model of an equal effect size for all altered genes. The required numbers of arrays are tabulated for various values of alpha, beta, Delta, eta, and lambda for the one-sample and two-sample t-tests for g = 10,000. Based on the proposed approach, to identify up to 90% of truly altered genes among the unknown number of truly altered genes, the estimated numbers of arrays needed appear to be manageable. For instance, when the standardized effect size is at least 2.0, the number of arrays needed is less than or equal to 14 for the two-sample t-test and is less than or equal to 10 for the one-sample t-test. As the cost per array declines, such array numbers become practical. The proposed method offers a simple, intuitive, and practical way to determine the number of arrays needed in microarray experiments in which the true correlation structure among the genes under investigation cannot be reasonably assumed. An example dataset is used to illustrate the use of the proposed approach to plan microarray experiments.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15579240     DOI: 10.1089/cmb.2004.11.714

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  12 in total

1.  A rapid genome-scale response of the transcriptional oscillator to perturbation reveals a period-doubling path to phenotypic change.

Authors:  Caroline M Li; Robert R Klevecz
Journal:  Proc Natl Acad Sci U S A       Date:  2006-10-16       Impact factor: 11.205

2.  Gene expression profiling in the rhesus macaque: methodology, annotation and data interpretation.

Authors:  Nigel C Noriega; Steven G Kohama; Henryk F Urbanski
Journal:  Methods       Date:  2009-05-23       Impact factor: 3.608

3.  A censored beta mixture model for the estimation of the proportion of non-differentially expressed genes.

Authors:  Anastasios Markitsis; Yinglei Lai
Journal:  Bioinformatics       Date:  2010-01-15       Impact factor: 6.937

4.  Power and sample size calculation for microarray studies.

Authors:  Sin-Ho Jung; S Stanley Young
Journal:  J Biopharm Stat       Date:  2012       Impact factor: 1.051

5.  Machine learning-based receiver operating characteristic (ROC) curves for crisp and fuzzy classification of DNA microarrays in cancer research.

Authors:  Leif E Peterson; Matthew A Coleman
Journal:  Int J Approx Reason       Date:  2008-01       Impact factor: 3.816

6.  A multicentre phase II gene expression profiling study of putative relationships between tumour biomarkers and clinical response with erlotinib in non-small-cell lung cancer.

Authors:  E-H Tan; R Ramlau; A Pluzanska; H-P Kuo; M Reck; J Milanowski; J S-K Au; E Felip; P-C Yang; D Damyanov; S Orlov; M Akimov; P Delmar; L Essioux; C Hillenbach; B Klughammer; P McLoughlin; J Baselga
Journal:  Ann Oncol       Date:  2010-02       Impact factor: 32.976

7.  Sample size and statistical power considerations in high-dimensionality data settings: a comparative study of classification algorithms.

Authors:  Yu Guo; Armin Graber; Robert N McBurney; Raji Balasubramanian
Journal:  BMC Bioinformatics       Date:  2010-09-03       Impact factor: 3.169

8.  Parallel multiplicity and error discovery rate (EDR) in microarray experiments.

Authors:  Wayne Wenzhong Xu; Clay J Carter
Journal:  BMC Bioinformatics       Date:  2010-09-16       Impact factor: 3.169

9.  Genome-wide DNA methylation indicates silencing of tumor suppressor genes in uterine leiomyoma.

Authors:  Antonia Navarro; Ping Yin; Diana Monsivais; Simon M Lin; Pan Du; Jian-Jun Wei; Serdar E Bulun
Journal:  PLoS One       Date:  2012-03-13       Impact factor: 3.240

10.  Power and sample size estimation in microarray studies.

Authors:  Wei-Jiun Lin; Huey-Miin Hsueh; James J Chen
Journal:  BMC Bioinformatics       Date:  2010-01-25       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.