Literature DB >> 19758448

A first principles approach to differential expression in microarray data analysis.

Robert A Rubin1.   

Abstract

BACKGROUND: The disparate results from the methods commonly used to determine differential expression in Affymetrix microarray experiments may well result from the wide variety of probe set and probe level models employed. Here we take the approach of making the fewest assumptions about the structure of the microarray data. Specifically, we only require that, under the null hypothesis that a gene is not differentially expressed for specified conditions, for any probe position in the gene's probe set: a) the probe amplitudes are independent and identically distributed over the conditions, and b) the distributions of the replicated probe amplitudes are amenable to classical analysis of variance (ANOVA). Log-amplitudes that have been standardized within-chip meet these conditions well enough for our approach, which is to perform ANOVA across conditions for each probe position, and then take the median of the resulting (1 - p) values as a gene-level measure of differential expression.
RESULTS: We applied the technique to the HGU-133A, HG-U95A, and "Golden Spike" spike-in data sets. The resulting receiver operating characteristic (ROC) curves compared favorably with other published results. This procedure is quite sensitive, so much so that it has revealed the presence of probe sets that might properly be called "unanticipated positives" rather than "false positives", because plots of these probe sets strongly suggest that they are differentially expressed.
CONCLUSION: The median ANOVA (1-p) approach presented here is a very simple methodology that does not depend on any specific probe level or probe models, and does not require any pre-processing other than within-chip standardization of probe level log amplitudes. Its performance is comparable to other published methods on the standard spike-in data sets, and has revealed the presence of new categories of probe sets that might properly be referred to as "unanticipated positives" and "unanticipated negatives" that need to be taken into account when using spiked-in data sets at "truthed" test beds.

Entities:  

Mesh:

Year:  2009        PMID: 19758448      PMCID: PMC2749840          DOI: 10.1186/1471-2105-10-292

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  4 in total

1.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data.

Authors:  Manhong Dai; Pinglang Wang; Andrew D Boyd; Georgi Kostov; Brian Athey; Edward G Jones; William E Bunney; Richard M Myers; Terry P Speed; Huda Akil; Stanley J Watson; Fan Meng
Journal:  Nucleic Acids Res       Date:  2005-11-10       Impact factor: 16.971

2.  Feature-level exploration of a published Affymetrix GeneChip control dataset.

Authors:  Rafael A Irizarry; Leslie M Cope; Zhijin Wu
Journal:  Genome Biol       Date:  2006       Impact factor: 13.583

3.  Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset.

Authors:  Sung E Choe; Michael Boutros; Alan M Michelson; George M Church; Marc S Halfon
Journal:  Genome Biol       Date:  2005-01-28       Impact factor: 13.583

4.  A comprehensive re-analysis of the Golden Spike data: towards a benchmark for differential expression methods.

Authors:  Richard D Pearson
Journal:  BMC Bioinformatics       Date:  2008-03-26       Impact factor: 3.169

  4 in total
  2 in total

1.  A comparison of probe-level and probeset models for small-sample gene expression data.

Authors:  John R Stevens; Jason L Bell; Kenneth I Aston; Kenneth L White
Journal:  BMC Bioinformatics       Date:  2010-05-26       Impact factor: 3.169

2.  Vibrio campbellii hmgA-mediated pyomelanization impairs quorum sensing, virulence, and cellular fitness.

Authors:  Zheng Wang; Baochuan Lin; Anahita Mostaghim; Robert A Rubin; Evan R Glaser; Pimonsri Mittraparp-Arthorn; Janelle R Thompson; Varaporn Vuddhakul; Gary J Vora
Journal:  Front Microbiol       Date:  2013-12-11       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.