Literature DB >> 17046978

Estimation of false discovery proportion under general dependence.

Yudi Pawitan1, Stefano Calza, Alexander Ploner.   

Abstract

MOTIVATION: Wide-scale correlations between genes are commonly observed in gene expression data, due to both biological and technical reasons. These correlations increase the variability of the standard estimate of the false discovery rate (FDR). We highlight the false discovery proportion (FDP, instead of the FDR) as the suitable quantity for assessing differential expression in microarray data, demonstrate the deleterious effects of correlation on FDP estimation and propose an improved estimation method that accounts for the correlations.
METHODS: We analyse the variation pattern of the distribution of test statistics under permutation using the singular value decomposition. The results suggest a latent FDR model that accounts for the effects of correlation, and is statistically closer to the FDP. We develop a procedure for estimating the latent FDR (ELF) based on a Poisson regression model.
RESULTS: For simulated data based on the correlation structure of real datasets, we find that ELF performs substantially better than the standard FDR approach in estimating the FDP. We illustrate the use of ELF in the analysis of breast cancer and lymphoma data. AVAILABILITY: R code to perform ELF is available in http://www.meb.ki.se/~yudpaw.

Entities:  

Mesh:

Year:  2006        PMID: 17046978     DOI: 10.1093/bioinformatics/btl527

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  10 in total

Review 1.  Genome-wide association studies and the genetic dissection of complex traits.

Authors:  Paola Sebastiani; Nadia Timofeev; Daniel A Dworkis; Thomas T Perls; Martin H Steinberg
Journal:  Am J Hematol       Date:  2009-08       Impact factor: 10.047

2.  Comments on the analysis of unbalanced microarray data.

Authors:  Kathleen F Kerr
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

3.  An efficient method to identify differentially expressed genes in microarray experiments.

Authors:  Huaizhen Qin; Tao Feng; Scott A Harding; Chung-Jui Tsai; Shuanglin Zhang
Journal:  Bioinformatics       Date:  2008-05-03       Impact factor: 6.937

4.  Empirical null distribution based modeling of multi-class differential gene expression detection.

Authors:  Xiting Cao; Baolin Wu; Marshall I Hertz
Journal:  J Appl Stat       Date:  2012-11-21       Impact factor: 1.404

5.  Identification of significant features in DNA microarray data.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013-07

6.  fdrci: FDR confidence interval selection and adjustment for large-scale hypothesis testing.

Authors:  Joshua Millstein; Francesca Battaglin; Hiroyuki Arai; Wu Zhang; Priya Jayachandran; Shivani Soni; Aparna R Parikh; Christoph Mancao; Heinz-Josef Lenz
Journal:  Bioinform Adv       Date:  2022-06-13

7.  Sources of variation in false discovery rate estimation include sample size, correlation, and inherent differences between groups.

Authors:  Jiexin Zhang; Kevin R Coombes
Journal:  BMC Bioinformatics       Date:  2012-08-24       Impact factor: 3.169

8.  Capturing heterogeneity in gene expression studies by surrogate variable analysis.

Authors:  Jeffrey T Leek; John D Storey
Journal:  PLoS Genet       Date:  2007-08-01       Impact factor: 5.917

9.  Weighted analysis of general microarray experiments.

Authors:  Anders Sjögren; Erik Kristiansson; Mats Rudemo; Olle Nerman
Journal:  BMC Bioinformatics       Date:  2007-10-15       Impact factor: 3.169

10.  Identifying and Assessing Interesting Subgroups in a Heterogeneous Population.

Authors:  Woojoo Lee; Andrey Alexeyenko; Maria Pernemalm; Justine Guegan; Philippe Dessen; Vladimir Lazar; Janne Lehtiö; Yudi Pawitan
Journal:  Biomed Res Int       Date:  2015-08-03       Impact factor: 3.411

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.