MOTIVATION: Maintaining and controlling data quality is a key problem in large scale microarray studies. In particular systematic changes in experimental conditions across multiple chips can seriously affect quality and even lead to false biological conclusions. Traditionally the influence of these effects can be minimized only by expensive repeated measurements, because a detailed understanding of all process relevant parameters seems impossible. RESULTS: We introduce a novel method for microarray process control that estimates quality based solely on the distribution of the actual measurements without requiring repeated experiments. A robust version of principle component analysis detects single outlier microarrays and thereby enables the use of techniques from multivariate statistical process control. In particular, the T(2) control chart reliably tracks undesired changes in process relevant parameters. This can be used to improve the microarray process itself, limits necessary repetitions to only affected samples and therefore maintains quality in a cost effective way. We prove the power of the approach on 3 large sets of DNA methylation microarray data.
MOTIVATION: Maintaining and controlling data quality is a key problem in large scale microarray studies. In particular systematic changes in experimental conditions across multiple chips can seriously affect quality and even lead to false biological conclusions. Traditionally the influence of these effects can be minimized only by expensive repeated measurements, because a detailed understanding of all process relevant parameters seems impossible. RESULTS: We introduce a novel method for microarray process control that estimates quality based solely on the distribution of the actual measurements without requiring repeated experiments. A robust version of principle component analysis detects single outlier microarrays and thereby enables the use of techniques from multivariate statistical process control. In particular, the T(2) control chart reliably tracks undesired changes in process relevant parameters. This can be used to improve the microarray process itself, limits necessary repetitions to only affected samples and therefore maintains quality in a cost effective way. We prove the power of the approach on 3 large sets of DNA methylation microarray data.
Authors: L D Burgoon; J E Eckel-Passow; C Gennings; D R Boverhof; J W Burt; C J Fong; T R Zacharewski Journal: Nucleic Acids Res Date: 2005-11-04 Impact factor: 16.971
Authors: William B Mattes; Syril D Pettit; Susanna-Assunta Sansone; Pierre R Bushel; Michael D Waters Journal: Environ Health Perspect Date: 2004-03 Impact factor: 9.031
Authors: Matthew E Ritchie; Dileepa Diyagama; Jody Neilson; Ryan van Laar; Alexander Dobrovic; Andrew Holloway; Gordon K Smyth Journal: BMC Bioinformatics Date: 2006-05-19 Impact factor: 3.169
Authors: Song Yang; Xiang Guo; Yaw-Ching Yang; Denise Papcunik; Caroline Heckman; Jeffrey Hooke; Craig D Shriver; Michael N Liebman; Hai Hu Journal: Cancer Inform Date: 2007-02-24