MOTIVATION: Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. RESULTS: The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. AVAILABILITY: The GEA code for R software is freely available upon request to authors.
MOTIVATION: Microarray technology has become a powerful research tool in many fields of study; however, the cost of microarrays often results in the use of a low number of replicates (k). Under circumstances where k is low, it becomes difficult to perform standard statistical tests to extract the most biologically significant experimental results. Other more advanced statistical tests have been developed; however, their use and interpretation often remain difficult to implement in routine biological research. The present work outlines a method that achieves sufficient statistical power for selecting differentially expressed genes under conditions of low k, while remaining as an intuitive and computationally efficient procedure. RESULTS: The present study describes a Global Error Assessment (GEA) methodology to select differentially expressed genes in microarray datasets, and was developed using an in vitro experiment that compared control and interferon-gamma treated skin cells. In this experiment, up to nine replicates were used to confidently estimate error, thereby enabling methods of different statistical power to be compared. Gene expression results of a similar absolute expression are binned, so as to enable a highly accurate local estimate of the mean squared error within conditions. The model then relates variability of gene expression in each bin to absolute expression levels and uses this in a test derived from the classical ANOVA. The GEA selection method is compared with both the classical and permutational ANOVA tests, and demonstrates an increased stability, robustness and confidence in gene selection. A subset of the selected genes were validated by real-time reverse transcription-polymerase chain reaction (RT-PCR). All these results suggest that GEA methodology is (i) suitable for selection of differentially expressed genes in microarray data, (ii) intuitive and computationally efficient and (iii) especially advantageous under conditions of low k. AVAILABILITY: The GEA code for R software is freely available upon request to authors.
Authors: Pascale Anderle; Thierry Sengstag; David M Mutch; Martin Rumbo; Viviane Praz; Robert Mansourian; Mauro Delorenzi; Gary Williamson; Matthew-Alan Roberts Journal: BMC Genomics Date: 2005-05-10 Impact factor: 3.969
Authors: M Membrez; C J Chou; F Raymond; R Mansourian; M Moser; I Monnard; C Ammon-Zufferey; K Mace; G Mingrone; C Binnert Journal: Diabetes Obes Metab Date: 2010-12 Impact factor: 6.577
Authors: Elena Maria Comelli; Sofiane Lariani; Marie-Camille Zwahlen; Grigorios Fotopoulos; James Anthony Holzwarth; Christine Cherbut; Gian Dorta; Irène Corthésy-Theulaz; Martin Grigorov Journal: Mamm Genome Date: 2009-08-27 Impact factor: 2.957
Authors: Frédéric Raymond; Long Wang; Mireille Moser; Sylviane Metairon; Robert Mansourian; Marie-Camille Zwahlen; Martin Kussmann; Andreas Fuerholz; Katherine Macé; Chieh Jason Chou Journal: PLoS One Date: 2012-11-06 Impact factor: 3.240