Literature DB >> 20460310

Independent filtering increases detection power for high-throughput experiments.

Richard Bourgon1, Robert Gentleman, Wolfgang Huber.   

Abstract

With high-dimensional data, variable-by-variable statistical testing is often used to select variables whose behavior differs across conditions. Such an approach requires adjustment for multiple testing, which can result in low statistical power. A two-stage approach that first filters variables by a criterion independent of the test statistic, and then only tests variables which pass the filter, can provide higher power. We show that use of some filter/test statistics pairs presented in the literature may, however, lead to loss of type I error control. We describe other pairs which avoid this problem. In an application to microarray data, we found that gene-by-gene filtering by overall variance followed by a t-test increased the number of discoveries by 50%. We also show that this particular statistic pair induces a lower bound on fold-change among the set of discoveries. Independent filtering-using filter/test pairs that are independent under the null hypothesis but correlated under the alternative-is a general approach that can substantially increase the efficiency of experiments.

Entities:  

Mesh:

Year:  2010        PMID: 20460310      PMCID: PMC2906865          DOI: 10.1073/pnas.0914005107

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  17 in total

1.  Analysis of variance for gene expression microarray data.

Authors:  M K Kerr; M Martin; G A Churchill
Journal:  J Comput Biol       Date:  2000       Impact factor: 1.479

2.  Linear models and empirical bayes methods for assessing differential expression in microarray experiments.

Authors:  Gordon K Smyth
Journal:  Stat Appl Genet Mol Biol       Date:  2004-02-12

3.  A method to increase the power of multiple testing procedures through sample splitting.

Authors:  Daniel Rubin; Sandrine Dudoit; Mark van der Laan
Journal:  Stat Appl Genet Mol Biol       Date:  2006-08-01

4.  A class comparison method with filtering-enhanced variable selection for high-dimensional data sets.

Authors:  Lara Lusa; Edward L Korn; Lisa M McShane
Journal:  Stat Med       Date:  2008-12-10       Impact factor: 2.373

5.  Discussion of "Sure Independence Screening for Ultra-High Dimensional Feature Space.

Authors:  Hao Helen Zhang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-11       Impact factor: 4.488

6.  I/NI-calls for the exclusion of non-informative genes: a highly effective filtering tool for microarray data.

Authors:  Willem Talloen; Djork-Arné Clevert; Sepp Hochreiter; Dhammika Amaratunga; Luc Bijnens; Stefan Kass; Hinrich W H Göhlmann
Journal:  Bioinformatics       Date:  2007-10-05       Impact factor: 6.937

7.  Gene expression profile of adult T-cell acute lymphocytic leukemia identifies distinct subsets of patients with different response to therapy and survival.

Authors:  Sabina Chiaretti; Xiaochun Li; Robert Gentleman; Antonella Vitale; Marco Vignetti; Franco Mandelli; Jerome Ritz; Robin Foa
Journal:  Blood       Date:  2003-12-18       Impact factor: 22.113

8.  HIGH DIMENSIONAL VARIABLE SELECTION.

Authors:  Larry Wasserman; Kathryn Roeder
Journal:  Ann Stat       Date:  2009-01-01       Impact factor: 4.028

9.  Bioconductor: open software development for computational biology and bioinformatics.

Authors:  Robert C Gentleman; Vincent J Carey; Douglas M Bates; Ben Bolstad; Marcel Dettling; Sandrine Dudoit; Byron Ellis; Laurent Gautier; Yongchao Ge; Jeff Gentry; Kurt Hornik; Torsten Hothorn; Wolfgang Huber; Stefano Iacus; Rafael Irizarry; Friedrich Leisch; Cheng Li; Martin Maechler; Anthony J Rossini; Gunther Sawitzki; Colin Smith; Gordon Smyth; Luke Tierney; Jean Y H Yang; Jianhua Zhang
Journal:  Genome Biol       Date:  2004-09-15       Impact factor: 13.583

10.  Filtering genes for cluster and network analysis.

Authors:  David Tritchler; Elena Parkhomenko; Joseph Beyene
Journal:  BMC Bioinformatics       Date:  2009-06-23       Impact factor: 3.169

View more
  312 in total

1.  Regulation of neuronal gene expression and survival by basal NMDA receptor activity: a role for histone deacetylase 4.

Authors:  Yelin Chen; Yuanyuan Wang; Zora Modrusan; Morgan Sheng; Joshua S Kaminker
Journal:  J Neurosci       Date:  2014-11-12       Impact factor: 6.167

2.  Filtering data from high-throughput experiments based on measurement reliability.

Authors:  Willem Talloen; Sepp Hochreiter; Luc Bijnens; Adetayo Kasim; Ziv Shkedy; Dhammika Amaratunga; Hinrich Göhlmann
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-08       Impact factor: 11.205

3.  Evolution of gene expression and expression plasticity in long-term experimental populations of Drosophila melanogaster maintained under constant and variable ethanol stress.

Authors:  Lev Y Yampolsky; Galina V Glazko; James D Fry
Journal:  Mol Ecol       Date:  2012-07-09       Impact factor: 6.185

4.  Two-stage testing procedures with independent filtering for genome-wide gene-environment interaction.

Authors:  James Y Dai; Charles Kooperberg; Michael Leblanc; Ross L Prentice
Journal:  Biometrika       Date:  2012-09-25       Impact factor: 2.445

5.  Count-based differential expression analysis of RNA sequencing data using R and Bioconductor.

Authors:  Simon Anders; Davis J McCarthy; Yunshun Chen; Michal Okoniewski; Gordon K Smyth; Wolfgang Huber; Mark D Robinson
Journal:  Nat Protoc       Date:  2013-08-22       Impact factor: 13.491

6.  De novo detection of differentially bound regions for ChIP-seq data using peaks and windows: controlling error rates correctly.

Authors:  Aaron T L Lun; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2014-05-22       Impact factor: 16.971

7.  Transcriptome-wide RNA sequencing analysis of rat skeletal muscle feed arteries. I. Impact of obesity.

Authors:  Nathan T Jenkins; Jaume Padilla; Pamela K Thorne; Jeffrey S Martin; R Scott Rector; J Wade Davis; M Harold Laughlin
Journal:  J Appl Physiol (1985)       Date:  2014-01-16

8.  Broad defects in the energy metabolism of leukocytes underlie immunoparalysis in sepsis.

Authors:  Shih-Chin Cheng; Brendon P Scicluna; Rob J W Arts; Mark S Gresnigt; Ekta Lachmandas; Evangelos J Giamarellos-Bourboulis; Matthijs Kox; Ganesh R Manjeri; Jori A L Wagenaars; Olaf L Cremer; Jenneke Leentjens; Anne J van der Meer; Frank L van de Veerdonk; Marc J Bonten; Marcus J Schultz; Peter H G M Willems; Peter Pickkers; Leo A B Joosten; Tom van der Poll; Mihai G Netea
Journal:  Nat Immunol       Date:  2016-03-07       Impact factor: 25.606

9.  Differential processing and localization of human Nocturnin controls metabolism of mRNA and nicotinamide adenine dinucleotide cofactors.

Authors:  Elizabeth T Abshire; Kelsey L Hughes; Rucheng Diao; Sarah Pearce; Shreekara Gopalakrishna; Raymond C Trievel; Joanna Rorbach; Peter L Freddolino; Aaron C Goldstrohm
Journal:  J Biol Chem       Date:  2020-08-23       Impact factor: 5.157

10.  Mapping Proteome-Wide Targets of Environmental Chemicals Using Reactivity-Based Chemoproteomic Platforms.

Authors:  Daniel Medina-Cleghorn; Leslie A Bateman; Breanna Ford; Ann Heslin; Karl J Fisher; Esha D Dalvie; Daniel K Nomura
Journal:  Chem Biol       Date:  2015-10-22
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.