Literature DB >> 21186247

A computationally efficient modular optimal discovery procedure.

Sangsoon Woo1, Jeffrey T Leek, John D Storey.   

Abstract

MOTIVATION: It is well known that patterns of differential gene expression across biological conditions are often shared by many genes, particularly those within functional groups. Taking advantage of these patterns can lead to increased statistical power and biological clarity when testing for differential expression in a microarray experiment. The optimal discovery procedure (ODP), which maximizes the expected number of true positives for each fixed number of expected false positives, is a framework aimed at this goal. Storey et al. introduced an estimator of the ODP for identifying differentially expressed genes. However, their ODP estimator grows quadratically in computational time with respect to the number of genes. Reducing this computational burden is a key step in making the ODP practical for usage in a variety of high-throughput problems.
RESULTS: Here, we propose a new estimate of the ODP called the modular ODP (mODP). The existing 'full ODP' requires that the likelihood function for each gene be evaluated according to the parameter estimates for all genes. The mODP assigns genes to modules according to a Kullback-Leibler distance, and then evaluates the statistic only at the module-averaged parameter estimates. We show that the mODP is relatively insensitive to the choice of the number of modules, but dramatically reduces the computational complexity from quadratic to linear in the number of genes. We compare the full ODP algorithm and mODP on simulated data and gene expression data from a recent study of Morrocan Amazighs. The mODP and full ODP algorithm perform very similarly across a range of comparisons. AVAILABILITY: The mODP methodology has been implemented into EDGE, a comprehensive gene expression analysis software package in R, available at http://genomine.org/edge/.

Mesh:

Year:  2010        PMID: 21186247      PMCID: PMC3105483          DOI: 10.1093/bioinformatics/btq701

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  11 in total

1.  Statistical significance for genomewide studies.

Authors:  John D Storey; Robert Tibshirani
Journal:  Proc Natl Acad Sci U S A       Date:  2003-07-25       Impact factor: 11.205

2.  Detecting differential gene expression with a semiparametric hierarchical mixture method.

Authors:  Michael A Newton; Amine Noueiry; Deepayan Sarkar; Paul Ahlquist
Journal:  Biostatistics       Date:  2004-04       Impact factor: 5.899

3.  Improved statistical tests for differential gene expression by shrinking variance components estimates.

Authors:  Xiangqin Cui; J T Gene Hwang; Jing Qiu; Natalie J Blades; Gary A Churchill
Journal:  Biostatistics       Date:  2005-01       Impact factor: 5.899

4.  Significance analysis of time course microarray experiments.

Authors:  John D Storey; Wenzhong Xiao; Jeffrey T Leek; Ronald G Tompkins; Ronald W Davis
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-02       Impact factor: 11.205

5.  EDGE: extraction and analysis of differential gene expression.

Authors:  Jeffrey T Leek; Eva Monsen; Alan R Dabney; John D Storey
Journal:  Bioinformatics       Date:  2005-12-15       Impact factor: 6.937

6.  A general framework for weighted gene co-expression network analysis.

Authors:  Bin Zhang; Steve Horvath
Journal:  Stat Appl Genet Mol Biol       Date:  2005-08-12

7.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

8.  The optimal discovery procedure for large-scale significance testing, with applications to comparative microarray experiments.

Authors:  John D Storey; James Y Dai; Jeffrey T Leek
Journal:  Biostatistics       Date:  2006-08-23       Impact factor: 5.899

9.  A Bayesian Discovery Procedure.

Authors:  Michele Guindani; Peter Müller; Song Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2009-11-01       Impact factor: 4.488

10.  A genome-wide gene expression signature of environmental geography in leukocytes of Moroccan Amazighs.

Authors:  Youssef Idaghdour; John D Storey; Sami J Jadallah; Greg Gibson
Journal:  PLoS Genet       Date:  2008-04-11       Impact factor: 5.917

View more
  8 in total

Review 1.  Studying and modelling dynamic biological processes using time-series gene expression data.

Authors:  Ziv Bar-Joseph; Anthony Gitter; Itamar Simon
Journal:  Nat Rev Genet       Date:  2012-07-18       Impact factor: 53.242

2.  Mistimed food intake and sleep alters 24-hour time-of-day patterns of the human plasma proteome.

Authors:  Christopher M Depner; Edward L Melanson; Andrew W McHill; Kenneth P Wright
Journal:  Proc Natl Acad Sci U S A       Date:  2018-05-21       Impact factor: 11.205

3.  Human immunophenotyping via low-variance, low-bias, interpretive regression modeling of small, wide data sets: Application to aging and immune response to influenza vaccination.

Authors:  Tyson H Holmes; Xiao-Song He
Journal:  J Immunol Methods       Date:  2016-05-16       Impact factor: 2.303

4.  The transcriptome of a complete episode of acute otitis media.

Authors:  Michelle Hernandez; Anke Leichtle; Kwang Pak; Nicholas J Webster; Stephen I Wasserman; Allen F Ryan
Journal:  BMC Genomics       Date:  2015-04-03       Impact factor: 3.969

5.  An empirical Bayes optimal discovery procedure based on semiparametric hierarchical mixture models.

Authors:  Hisashi Noma; Shigeyuki Matsui
Journal:  Comput Math Methods Med       Date:  2013-04-10       Impact factor: 2.238

6.  Toward an understanding of changes in diversity associated with fecal microbiome transplantation based on 16S rRNA gene deep sequencing.

Authors:  Dea Shahinas; Michael Silverman; Taylor Sittler; Charles Chiu; Peter Kim; Emma Allen-Vercoe; Scott Weese; Andrew Wong; Donald E Low; Dylan R Pillai
Journal:  MBio       Date:  2012-10-23       Impact factor: 7.867

7.  Dynamic changes in the brain protein interaction network correlates with progression of Aβ42 pathology in Drosophila.

Authors:  Harry M Scholes; Adam Cryar; Fiona Kerr; David Sutherland; Lee A Gethings; Johannes P C Vissers; Jonathan G Lees; Christine A Orengo; Linda Partridge; Konstantinos Thalassinos
Journal:  Sci Rep       Date:  2020-10-28       Impact factor: 4.379

8.  The optimal discovery procedure for significance analysis of general gene expression studies.

Authors:  Andrew J Bass; John D Storey
Journal:  Bioinformatics       Date:  2021-04-20       Impact factor: 6.931

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.