Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes.

Literature DB >> 16646853

Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes.

Xing Qiu¹, Lev Klebanov, Andrei Yakovlev.

Abstract

Stochastic dependence between gene expression levels in microarray data is of critical importance for the methods of statistical inference that resort to pooling test statistics across genes. The empirical Bayes methodology in the nonparametric and parametric formulations, as well as closely related methods employing a two-component mixture model, represent typical examples. It is frequently assumed that dependence between gene expressions (or associated test statistics) is sufficiently weak to justify the application of such methods for selecting differentially expressed genes. By applying resampling techniques to simulated and real biological data sets, we have studied a potential impact of the correlation between gene expression levels on the statistical inference based on the empirical Bayes methodology. We report evidence from these analyses that this impact may be quite strong, leading to a high variance of the number of differentially expressed genes. This study also pinpoints specific components of the empirical Bayes method where the reported effect manifests itself.

Year: 2005 PMID： 16646853 DOI： 10.2202/1544-6115.1157

Source DB: PubMed Journal: Stat Appl Genet Mol Biol ISSN： 1544-6115

Keyword Cloud
Cited

37 in total

Correlation between gene expression levels and limitations of the empirical bayes methodology for finding differentially expressed genes.

Review 1. Utility of correlation measures in analysis of gene expression.

2. A general framework for multiple testing dependence.

3. Comments on the analysis of unbalanced microarray data.

4. A new gene selection procedure based on the covariance distance.

5. Correlated z-values and the accuracy of large-scale statistical estimates.

6. Illustrations on Using the Distribution of a P-value in High Dimensional Data Analyses.

7. Region-based Statistical Analysis of 2D PAGE Images.

8. Identifying common prognostic factors in genomic cancer studies: a novel index for censored outcomes.

9. On the choice and number of microarrays for transcriptional regulatory network inference.

10. The limitations of simple gene set enrichment analysis assuming gene independence.