Literature DB >> 16646853

Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes.

Xing Qiu1, Lev Klebanov, Andrei Yakovlev.   

Abstract

Stochastic dependence between gene expression levels in microarray data is of critical importance for the methods of statistical inference that resort to pooling test statistics across genes. The empirical Bayes methodology in the nonparametric and parametric formulations, as well as closely related methods employing a two-component mixture model, represent typical examples. It is frequently assumed that dependence between gene expressions (or associated test statistics) is sufficiently weak to justify the application of such methods for selecting differentially expressed genes. By applying resampling techniques to simulated and real biological data sets, we have studied a potential impact of the correlation between gene expression levels on the statistical inference based on the empirical Bayes methodology. We report evidence from these analyses that this impact may be quite strong, leading to a high variance of the number of differentially expressed genes. This study also pinpoints specific components of the empirical Bayes method where the reported effect manifests itself.

Year:  2005        PMID: 16646853     DOI: 10.2202/1544-6115.1157

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  37 in total

Review 1.  Utility of correlation measures in analysis of gene expression.

Authors:  Anthony Almudevar; Lev B Klebanov; Xing Qiu; Peter Salzman; Andrei Y Yakovlev
Journal:  NeuroRx       Date:  2006-07

2.  A general framework for multiple testing dependence.

Authors:  Jeffrey T Leek; John D Storey
Journal:  Proc Natl Acad Sci U S A       Date:  2008-11-24       Impact factor: 11.205

3.  Comments on the analysis of unbalanced microarray data.

Authors:  Kathleen F Kerr
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

4.  A new gene selection procedure based on the covariance distance.

Authors:  Rui Hu; Xing Qiu; Galina Glazko
Journal:  Bioinformatics       Date:  2009-12-08       Impact factor: 6.937

5.  Correlated z-values and the accuracy of large-scale statistical estimates.

Authors:  Bradley Efron
Journal:  J Am Stat Assoc       Date:  2010-09-01       Impact factor: 5.033

6.  Illustrations on Using the Distribution of a P-value in High Dimensional Data Analyses.

Authors:  Xiaojun Hu; Gary L Gadbury; Qinfang Xiang; David B Allison
Journal:  Adv Appl Stat Sci       Date:  2010-02

7.  Region-based Statistical Analysis of 2D PAGE Images.

Authors:  Feng Li; Françoise Seillier-Moiseiwitsch; Valeriy R Korostyshevskiy
Journal:  Comput Stat Data Anal       Date:  2011-11-01       Impact factor: 1.681

8.  Identifying common prognostic factors in genomic cancer studies: a novel index for censored outcomes.

Authors:  Sigrid Rouam; Thierry Moreau; Philippe Broët
Journal:  BMC Bioinformatics       Date:  2010-03-24       Impact factor: 3.169

9.  On the choice and number of microarrays for transcriptional regulatory network inference.

Authors:  Elissa J Cosgrove; Timothy S Gardner; Eric D Kolaczyk
Journal:  BMC Bioinformatics       Date:  2010-09-09       Impact factor: 3.169

10.  The limitations of simple gene set enrichment analysis assuming gene independence.

Authors:  Pablo Tamayo; George Steinhardt; Arthur Liberzon; Jill P Mesirov
Journal:  Stat Methods Med Res       Date:  2012-10-14       Impact factor: 3.021

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.