Literature DB >> 15702196

Comparison of preprocessing procedures for oligo-nucleotide micro-arrays by parametric bootstrap simulation of spike-in experiments.

J Freudenberg1, H Boriss, D Hasenclever.   

Abstract

OBJECTIVE: Due to scarcity of calibration data for microarray experiments, simulation methods are employed to assess preprocessing procedures. Here we analyze several procedures' robustness against increasing numbers of differentially expressed genes and varying proportions of up-regulation.
METHODS: Raw probe data from oligo-nucleotide microarrays are assumed to be approximately multivariate normally distributed on the log scale. Chips can be simulated from a multivariate normal distribution with mean and variance-covariance matrix estimated from a real raw data set. A chip effect induces strong positive correlations. In reverse, sampling from a normal distribution with strong correlation variance-covariance matrix generates data exhibiting a chip effect. No explicit model of chip-effect is needed. Differences can be artificially spiked-in according to a given distribution of effect sizes. Thirty preprocessing procedures combining background correction, normalization, perfect match correction and summarization methods available from the BioConductor project were compared.
RESULTS: In the symmetrical setting "50% differentially expressed genes, 50% of which up-regulated" background correction reduces bias, but inflates low intensity probe variance as well as the mean squared error of the estimates. Any normalization reduces variance and increases sensitivity with no clear winner. Asymmetry between up and down regulation causes bias in the effect-size estimate of non-differentially expressed genes. This markedly inflates the false positive discovery rates. Variance stabilizing normalization (VSN) behaved best.
CONCLUSION: A simple parametric bootstrap was used to simulate oligo-nucleotide micro-array raw data. Current normalization methods inflate the false positive rate when many genes show an effect in the same direction.

Entities:  

Mesh:

Year:  2004        PMID: 15702196

Source DB:  PubMed          Journal:  Methods Inf Med        ISSN: 0026-1270            Impact factor:   2.176


  6 in total

1.  Transcriptome dynamics of the microRNA inhibition response.

Authors:  Jiayu Wen; Eleonora Leucci; Roberto Vendramin; Sakari Kauppinen; Anders H Lund; Anders Krogh; Brian J Parker
Journal:  Nucleic Acids Res       Date:  2015-06-18       Impact factor: 16.971

2.  Probe set algorithms: is there a rational best bet?

Authors:  Jinwook Seo; Eric P Hoffman
Journal:  BMC Bioinformatics       Date:  2006-08-30       Impact factor: 3.169

3.  Global rank-invariant set normalization (GRSN) to reduce systematic distortions in microarray data.

Authors:  Carl R Pelz; Molly Kulesz-Martin; Grover Bagby; Rosalie C Sears
Journal:  BMC Bioinformatics       Date:  2008-12-04       Impact factor: 3.169

4.  The effect of oligonucleotide microarray data pre-processing on the analysis of patient-cohort studies.

Authors:  Roel G W Verhaak; Frank J T Staal; Peter J M Valk; Bob Lowenberg; Marcel J T Reinders; Dick de Ridder
Journal:  BMC Bioinformatics       Date:  2006-03-02       Impact factor: 3.169

5.  Pre-processing Agilent microarray data.

Authors:  Marianna Zahurak; Giovanni Parmigiani; Wayne Yu; Robert B Scharpf; David Berman; Edward Schaeffer; Shabana Shabbeer; Leslie Cope
Journal:  BMC Bioinformatics       Date:  2007-05-01       Impact factor: 3.169

6.  A Bootstrap Based Measure Robust to the Choice of Normalization Methods for Detecting Rhythmic Features in High Dimensional Data.

Authors:  Yolanda Larriba; Cristina Rueda; Miguel A Fernández; Shyamal D Peddada
Journal:  Front Genet       Date:  2018-02-02       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.