Debashis Ghosh1. 1. Department of Biostatistics, School of Public Health, University of Michigan, 1420 Washington Heights, Ann Arbor, MI 48109-2029, USA. ghoshd@umich.edu
Abstract
MOTIVATION: The use of DNA microarrays has become quite popular in many scientific and medical disciplines, such as in cancer research. One common goal of these studies is to determine which genes are differentially expressed between cancer and healthy tissue, or more generally, between two experimental conditions. A major complication in the molecular profiling of tumors using gene expression data is that the data represent a combination of tumor and normal cells. Much of the methodology developed for assessing differential expression with microarray data has assumed that tissue samples are homogeneous. RESULTS: In this paper, we outline a general framework for determining differential expression in the presence of mixed cell populations. We consider study designs in which paired tissues and unpaired tissues are available. A hierarchical mixture model is used for modeling the data; a combination of methods of moments procedures and the expectation-maximization algorithm are used to estimate the model parameters. The finite-sample properties of the methods are assessed in simulation studies; they are applied to two microarray datasets from cancer studies. Commands in the R language can be downloaded from the URL http://www.sph.umich.edu/~ghoshd/COMPBIO/COMPMIX/.
MOTIVATION: The use of DNA microarrays has become quite popular in many scientific and medical disciplines, such as in cancer research. One common goal of these studies is to determine which genes are differentially expressed between cancer and healthy tissue, or more generally, between two experimental conditions. A major complication in the molecular profiling of tumors using gene expression data is that the data represent a combination of tumor and normal cells. Much of the methodology developed for assessing differential expression with microarray data has assumed that tissue samples are homogeneous. RESULTS: In this paper, we outline a general framework for determining differential expression in the presence of mixed cell populations. We consider study designs in which paired tissues and unpaired tissues are available. A hierarchical mixture model is used for modeling the data; a combination of methods of moments procedures and the expectation-maximization algorithm are used to estimate the model parameters. The finite-sample properties of the methods are assessed in simulation studies; they are applied to two microarray datasets from cancer studies. Commands in the R language can be downloaded from the URL http://www.sph.umich.edu/~ghoshd/COMPBIO/COMPMIX/.
Authors: W M Muir; G J M Rosa; B R Pittendrigh; S Xu; S D Rider; M Fountain; J Ogas Journal: Comput Stat Data Anal Date: 2009-03-15 Impact factor: 1.681
Authors: Alexandre Kuhn; Azad Kumar; Alexandra Beilina; Allissa Dillman; Mark R Cookson; Andrew B Singleton Journal: BMC Genomics Date: 2012-11-12 Impact factor: 3.969
Authors: James R Bradford; Matthew Farren; Steve J Powell; Sarah Runswick; Susie L Weston; Helen Brown; Oona Delpuech; Mark Wappett; Neil R Smith; T Hedley Carr; Jonathan R Dry; Neil J Gibson; Simon T Barry Journal: PLoS One Date: 2013-06-19 Impact factor: 3.240
Authors: Dirk Repsilber; Sabine Kern; Anna Telaar; Gerhard Walzl; Gillian F Black; Joachim Selbig; Shreemanta K Parida; Stefan H E Kaufmann; Marc Jacobsen Journal: BMC Bioinformatics Date: 2010-01-14 Impact factor: 3.169