Megan Orr1, Peng Liu1, Dan Nettleton1. 1. Department of Statistics, North Dakota State University, Fargo, ND 58102, USA and Department of Statistics, Iowa State University, Ames, IA 50011, USA.
Abstract
MOTIVATION: Asymmetry is frequently observed in the empirical distribution of test statistics that results from the analysis of gene expression experiments. This asymmetry indicates an asymmetry in the distribution of effect sizes. A common method for identifying differentially expressed (DE) genes in a gene expression experiment while controlling false discovery rate (FDR) is Storey's q-value method. This method ranks genes based solely on the P-values from each gene in the experiment. RESULTS: We propose a method that alters and improves upon the q-value method by taking the sign of the test statistics, in addition to the P-values, into account. Through two simulation studies (one involving independent normal data and one involving microarray data), we show that the proposed method, when compared with the traditional q-value method, generally provides a better ranking for genes as well as a higher number of truly DE genes declared to be DE, while still adequately controlling FDR. We illustrate the proposed method by analyzing two microarray datasets, one from an experiment of thale cress seedlings and the other from an experiment of maize leaves. AVAILABILITY AND IMPLEMENTATION: The R code and data files for the proposed method and examples are available at Bioinformatics online.
MOTIVATION: Asymmetry is frequently observed in the empirical distribution of test statistics that results from the analysis of gene expression experiments. This asymmetry indicates an asymmetry in the distribution of effect sizes. A common method for identifying differentially expressed (DE) genes in a gene expression experiment while controlling false discovery rate (FDR) is Storey's q-value method. This method ranks genes based solely on the P-values from each gene in the experiment. RESULTS: We propose a method that alters and improves upon the q-value method by taking the sign of the test statistics, in addition to the P-values, into account. Through two simulation studies (one involving independent normal data and one involving microarray data), we show that the proposed method, when compared with the traditional q-value method, generally provides a better ranking for genes as well as a higher number of truly DE genes declared to be DE, while still adequately controlling FDR. We illustrate the proposed method by analyzing two microarray datasets, one from an experiment of thale cress seedlings and the other from an experiment of maize leaves. AVAILABILITY AND IMPLEMENTATION: The R code and data files for the proposed method and examples are available at Bioinformatics online.
Authors: Sridhar Hannenhalli; Mary E Putt; Joan M Gilmore; Junwen Wang; Michael S Parmacek; Jonathan A Epstein; Edward E Morrisey; Kenneth B Margulies; Thomas P Cappola Journal: Circulation Date: 2006-09-04 Impact factor: 29.690
Authors: Sarah Covshoff; Wojciech Majeran; Peng Liu; Judith M Kolkman; Klaas J van Wijk; Thomas P Brutnell Journal: Plant Physiol Date: 2008-02-07 Impact factor: 8.340