Literature DB >> 15882143

Statistical development and evaluation of microarray gene expression data filters.

Stan Pounds1, Cheng Cheng.   

Abstract

Filtering is a common practice used to simplify the analysis of microarray data by removing from subsequent consideration probe sets believed to be unexpressed. The m/n filter, which is widely used in the analysis of Affymetrix data, removes all probe sets having fewer than m present calls among a set of n chips. The m/n filter has been widely used without considering its statistical properties. The level and power of the m/n filter are derived. Two alternative filters, the pooled p-value filter and the error-minimizing pooled p-value filter are proposed. The pooled p-value filter combines information from the present-absent p-values into a single summary p-value which is subsequently compared to a selected significance threshold. We show that pooled p-value filter is the uniformly most powerful statistical test under a reasonable beta model and that it exhibits greater power than the m/n filter in all scenarios considered in a simulation study. The error-minimizing pooled p-value filter compares the summary p-value with a threshold determined to minimize a total-error criterion based on a partition of the distribution of all probes' summary p-values. The pooled p-value and error-minimizing pooled p-value filters clearly perform better than the m/n filter in a case-study analysis. The case-study analysis also demonstrates a proposed method for estimating the number of differentially expressed probe sets excluded by filtering and subsequent impact on the final analysis. The filter impact analysis shows that the use of even the best filter may hinder, rather than enhance, the ability to discover interesting probe sets or genes. S-plus and R routines to implement the pooled p-value and error-minimizing pooled p-value filters have been developed and are available from www.stjuderesearch.org/depts/biostats/index.html.

Mesh:

Year:  2005        PMID: 15882143     DOI: 10.1089/cmb.2005.12.482

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  11 in total

1.  Differential expression analysis of Digital Gene Expression data: RNA-tag filtering, comparison of t-type tests and their genome-wide co-expression based adjustments.

Authors:  Yinglei Lai
Journal:  Int J Bioinform Res Appl       Date:  2010

2.  PROMISE: a tool to identify genomic features with a specific biologically interesting pattern of associations with multiple endpoint variables.

Authors:  Stan Pounds; Cheng Cheng; Xueyuan Cao; Kristine R Crews; William Plunkett; Varsha Gandhi; Jeffrey Rubnitz; Raul C Ribeiro; James R Downing; Jatinder Lamba
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

3.  A procedure to statistically evaluate agreement of differential expression for cross-species genomics.

Authors:  Stan Pounds; Cuilan Lani Gao; Robert A Johnson; Karen D Wright; Helen Poppleton; David Finkelstein; Sarah E S Leary; Richard J Gilbertson
Journal:  Bioinformatics       Date:  2011-06-22       Impact factor: 6.937

4.  Reference alignment of SNP microarray signals for copy number analysis of tumors.

Authors:  Stan Pounds; Cheng Cheng; Charles Mullighan; Susana C Raimondi; Sheila Shurtleff; James R Downing
Journal:  Bioinformatics       Date:  2008-12-03       Impact factor: 6.937

Review 5.  Review of the literature examining the correlation among DNA microarray technologies.

Authors:  Carole L Yauk; M Lynn Berndt
Journal:  Environ Mol Mutagen       Date:  2007-06       Impact factor: 3.216

6.  Filtering, FDR and power.

Authors:  Maarten van Iterson; Judith M Boer; Renée X Menezes
Journal:  BMC Bioinformatics       Date:  2010-09-07       Impact factor: 3.169

7.  The effect of insulin on expression of genes and biochemical pathways in human skeletal muscle.

Authors:  Xuxia Wu; Jelai Wang; Xiangqin Cui; Lidia Maianu; Brian Rhees; James Rosinski; W Venus So; Steven M Willi; Michael V Osier; Helliner S Hill; Grier P Page; David B Allison; Mitchell Martin; W Timothy Garvey
Journal:  Endocrine       Date:  2007-02       Impact factor: 3.633

8.  Sources of variation in Affymetrix microarray experiments.

Authors:  Stanislav O Zakharkin; Kyoungmi Kim; Tapan Mehta; Lang Chen; Stephen Barnes; Katherine E Scheirer; Rudolph S Parrish; David B Allison; Grier P Page
Journal:  BMC Bioinformatics       Date:  2005-08-29       Impact factor: 3.169

9.  Tests for differential gene expression using weights in oligonucleotide microarray experiments.

Authors:  Pingzhao Hu; Joseph Beyene; Celia M T Greenwood
Journal:  BMC Genomics       Date:  2006-02-22       Impact factor: 3.969

10.  False discovery rate paradigms for statistical analyses of microarray gene expression data.

Authors:  Cheng Cheng; Stan Pounds
Journal:  Bioinformation       Date:  2007-04-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.