MOTIVATION: Oligonucleotide expression arrays exhibit systematic and reproducible variation produced by the multiple distinct probes used to represent a gene. Recently, a gene expression index has been proposed that explicitly models probe effects, and provides improved fits of hybridization intensity for arrays containing perfect match (PM) and mismatch (MM) probe pairs. RESULTS: Here we use a combination of analytical arguments and empirical data to show directly that the estimates provided by model-based expression indexes are superior to those provided by commercial software. The improvement is greatest for genes in which probe effects vary substantially, and modeling the PM and MM intensities separately is superior to using the PM-MM differences. To empirically compare expression indexes, we designed a mixing experiment involving three groups of human fibroblast cells (serum starved, serum stimulated, and a 50:50 mixture of starved/stimulated), with six replicate HuGeneFL arrays in each group. Careful spiking of control genes provides evidence that 88-98% of the genes on the array are detectably transcribed, and that the model-based estimates can accurately detect the presence versus absence of a gene. The use of extensive replication from single RNA sources enables exploration of the technical variability of the array.
MOTIVATION:Oligonucleotide expression arrays exhibit systematic and reproducible variation produced by the multiple distinct probes used to represent a gene. Recently, a gene expression index has been proposed that explicitly models probe effects, and provides improved fits of hybridization intensity for arrays containing perfect match (PM) and mismatch (MM) probe pairs. RESULTS: Here we use a combination of analytical arguments and empirical data to show directly that the estimates provided by model-based expression indexes are superior to those provided by commercial software. The improvement is greatest for genes in which probe effects vary substantially, and modeling the PM and MM intensities separately is superior to using the PM-MM differences. To empirically compare expression indexes, we designed a mixing experiment involving three groups of human fibroblast cells (serum starved, serum stimulated, and a 50:50 mixture of starved/stimulated), with six replicate HuGeneFL arrays in each group. Careful spiking of control genes provides evidence that 88-98% of the genes on the array are detectably transcribed, and that the model-based estimates can accurately detect the presence versus absence of a gene. The use of extensive replication from single RNA sources enables exploration of the technical variability of the array.
Authors: Rafael A Irizarry; Benjamin M Bolstad; Francois Collin; Leslie M Cope; Bridget Hobbs; Terence P Speed Journal: Nucleic Acids Res Date: 2003-02-15 Impact factor: 16.971
Authors: Aliaksei Z Holik; Charity W Law; Ruijie Liu; Zeya Wang; Wenyi Wang; Jaeil Ahn; Marie-Liesse Asselin-Labat; Gordon K Smyth; Matthew E Ritchie Journal: Nucleic Acids Res Date: 2017-03-17 Impact factor: 16.971
Authors: Heejei Yoon; Sandya Liyanarachchi; Fred A Wright; Ramana Davuluri; Janet C Lockman; Albert de la Chapelle; Natalia S Pellegata Journal: Proc Natl Acad Sci U S A Date: 2002-11-15 Impact factor: 11.205
Authors: Stefano Colella; Kristy L Richards; Linda L Bachinski; Keith A Baggerly; Spiridon Tsavachidis; James C Lang; David E Schuller; Ralf Krahe Journal: Head Neck Date: 2008-10 Impact factor: 3.147
Authors: Ramil N Nurtdinov; Mikhail O Vasiliev; Anna S Ershova; Ilia S Lossev; Anna S Karyagina Journal: Nucleic Acids Res Date: 2009-11-11 Impact factor: 16.971