MOTIVATION: Microarrays can simultaneously measure the expression levels of many genes and are widely applied to study complex biological problems at the genetic level. To contain costs, instead of obtaining a microarray on each individual, mRNA from several subjects can be first pooled and then measured with a single array. mRNA pooling is also necessary when there is not enough mRNA from each subject. Several studies have investigated the impact of pooling mRNA on inferences about gene expression, but have typically modeled the process of pooling as if it occurred in some transformed scale. This assumption is unrealistic. RESULTS: We propose modeling the gene expression levels in a pool as a weighted average of mRNA expression of all individuals in the pool on the original measurement scale, where the weights correspond to individual sample contributions to the pool. Based on these improved statistical models, we develop the appropriate F statistics to test for differentially expressed genes. We present formulae to calculate the power of various statistical tests under different strategies for pooling mRNA and compare resulting power estimates to those that would be obtained by following the approach proposed by Kendziorski et al. (2003). We find that the Kendziorski estimate tends to exceed true power and that the estimate we propose, while somewhat conservative, is less biased. We argue that it is possible to design a study that includes mRNA pooling at a significantly reduced cost but with little loss of information.
MOTIVATION: Microarrays can simultaneously measure the expression levels of many genes and are widely applied to study complex biological problems at the genetic level. To contain costs, instead of obtaining a microarray on each individual, mRNA from several subjects can be first pooled and then measured with a single array. mRNA pooling is also necessary when there is not enough mRNA from each subject. Several studies have investigated the impact of pooling mRNA on inferences about gene expression, but have typically modeled the process of pooling as if it occurred in some transformed scale. This assumption is unrealistic. RESULTS: We propose modeling the gene expression levels in a pool as a weighted average of mRNA expression of all individuals in the pool on the original measurement scale, where the weights correspond to individual sample contributions to the pool. Based on these improved statistical models, we develop the appropriate F statistics to test for differentially expressed genes. We present formulae to calculate the power of various statistical tests under different strategies for pooling mRNA and compare resulting power estimates to those that would be obtained by following the approach proposed by Kendziorski et al. (2003). We find that the Kendziorski estimate tends to exceed true power and that the estimate we propose, while somewhat conservative, is less biased. We argue that it is possible to design a study that includes mRNA pooling at a significantly reduced cost but with little loss of information.
Authors: Zhiping Liu; Amanda E Ramer-Tait; Abigail L Henderson; Cumhur Yusuf Demirkale; Dan Nettleton; Chong Wang; Jesse M Hostetter; Albert E Jergens; Michael J Wannemuehler Journal: Dig Dis Sci Date: 2011-04-19 Impact factor: 3.199
Authors: Bjorn Kloosterman; Marian Oortwijn; Jan uitdeWilligen; Twan America; Ric de Vos; Richard G F Visser; Christian W B Bachem Journal: BMC Genomics Date: 2010-03-08 Impact factor: 3.969
Authors: Adrián Millán; Antonio Gómez-Tato; Carlos Fernández; Belén G Pardo; José A Alvarez-Dios; Manuel Calaza; Carmen Bouza; María Vázquez; Santiago Cabaleiro; Paulino Martínez Journal: Mar Biotechnol (NY) Date: 2009-10-21 Impact factor: 3.619
Authors: Tisha C King Heiden; Craig A Struble; Matthew L Rise; Martin J Hessner; Reinhold J Hutz; Michael J Carvan Journal: Reprod Toxicol Date: 2007-08-11 Impact factor: 3.143
Authors: S C Williamson; R Mitter; A C Hepburn; L Wilson; A Mantilla; H Y Leung; C N Robson; R Heer Journal: Br J Cancer Date: 2013-07-23 Impact factor: 7.640
Authors: Raghunandan M Kainkaryam; Angela Bruex; Anna C Gilbert; John Schiefelbein; Peter J Woolf Journal: BMC Bioinformatics Date: 2010-06-02 Impact factor: 3.169