Literature DB >> 25346887

A recursively partitioned mixture model for clustering time-course gene expression data.

Devin C Koestler1, Carmen J Marsit2, Brock C Christensen2, Karl T Kelsey3, E Andres Houseman4.   

Abstract

BACKGROUND: Longitudinally collected gene expression data provides an opportunity to investigate the dynamic behavior of gene expression and is crucial for establishing causal links between changes on a molecular level and disease development and progression. In terms of the analysis of such data, clustering of subjects based on time-course expression data may improve our understanding of temporal expression patterns that result in disease phenotypes. Although there are numerous existing methods for clustering subjects using gene expression data, most are not suitable when expression measurements are repeatedly collected over a time-course.
METHODS: We present a modified version of the recursively partitioned mixture model (RPMM) for clustering subjects based on longitudinally collected gene expression data. In the proposed time-course RPMM (TC-RPMM), subjects are clustered on the basis of their temporal profiles of gene expression using a mixture of mixed effects models framework. This framework captures changes in gene expression over time and models the autocorrelation between repeated gene expression measurements for the same subject. We assessed the performance of TC-RPMM using extensive simulation studies and a dataset from a multi-center research study of inflammation and response to injury (www.gluegrant.org), which consisted of time-course gene expression data for 140 subjects.
RESULTS: Our simulation studies encompassed several different scenarios and were aimed at assessing the ability of TC-RPMM to correctly recover true class memberships when the expression trajectories that characterized those classes differed. Overall, our simulation studies revealed favorable performance of TC-RPMM compared to competing approaches, however clustering performance was observed to be highly dependent on the proportion of class discriminating genes used in clustering analysis. When applied to real epidemiologic data with repeated-measures, longitudinal gene expression measurements, TC-RPMM identified clusters that had strong biological and clinical significance.
CONCLUSIONS: Methods for clustering subjects based on temporal gene expression profiles is a high priority for molecular biology and bioinformatics research. Along these lines, the proposed TC-RPMM represents a promising new approach for analyzing time-course gene expression data.

Entities:  

Keywords:  Longitudinal gene expression data; clustering; mixture models; repeated-measures microarrays; time-course microarrays

Year:  2014        PMID: 25346887      PMCID: PMC4208690          DOI: 10.3978/j.issn.2218-676X.2014.06.04

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


  25 in total

1.  Systematic determination of genetic network architecture.

Authors:  S Tavazoie; J D Hughes; M J Campbell; R J Cho; G M Church
Journal:  Nat Genet       Date:  1999-07       Impact factor: 38.330

2.  Continuous representations of time-series gene expression data.

Authors:  Ziv Bar-Joseph; Georg K Gerber; David K Gifford; Tommi S Jaakkola; Itamar Simon
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

3.  Mixtures of regression models for time course gene expression data: evaluation of initialization and random effects.

Authors:  Theresa Scharl; Bettinan Grü; Friedrich Leisch
Journal:  Bioinformatics       Date:  2009-12-29       Impact factor: 6.937

4.  DNA methylation array analysis identifies profiles of blood-derived DNA methylation associated with bladder cancer.

Authors:  Carmen J Marsit; Devin C Koestler; Brock C Christensen; Margaret R Karagas; E Andres Houseman; Karl T Kelsey
Journal:  J Clin Oncol       Date:  2011-02-22       Impact factor: 44.544

5.  Random-effects models for longitudinal data.

Authors:  N M Laird; J H Ware
Journal:  Biometrics       Date:  1982-12       Impact factor: 2.571

6.  Modulation of inflammation by reactive oxygen species: implications for aging and tissue repair.

Authors:  B Khodr; Z Khalil
Journal:  Free Radic Biol Med       Date:  2001-01-01       Impact factor: 7.376

7.  Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.

Authors:  Devin C Koestler; Carmen J Marsit; Brock C Christensen; Margaret R Karagas; Raphael Bueno; David J Sugarbaker; Karl T Kelsey; E Andres Houseman
Journal:  Bioinformatics       Date:  2010-08-16       Impact factor: 6.937

8.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  Increased inflammation delays wound healing in mice deficient in collagenase-2 (MMP-8).

Authors:  Ana Gutiérrez-Fernández; Masaki Inada; Milagros Balbín; Antonio Fueyo; Ana S Pitiot; Aurora Astudillo; Kenji Hirose; Michiko Hirata; Steven D Shapiro; Agnès Noël; Zena Werb; Stephen M Krane; Carlos López-Otín; Xose S Puente
Journal:  FASEB J       Date:  2007-03-28       Impact factor: 5.191

10.  Clustering cancer gene expression data: a comparative study.

Authors:  Marcilio C P de Souto; Ivan G Costa; Daniel S A de Araujo; Teresa B Ludermir; Alexander Schliep
Journal:  BMC Bioinformatics       Date:  2008-11-27       Impact factor: 3.169

View more
  3 in total

1.  A Linear Mixed Model Spline Framework for Analysing Time Course 'Omics' Data.

Authors:  Jasmin Straube; Alain-Dominique Gorse; Bevan Emma Huang; Kim-Anh Lê Cao
Journal:  PLoS One       Date:  2015-08-27       Impact factor: 3.240

2.  Promoter methylation of DNA damage repair (DDR) genes in human tumor entities: RBBP8/CtIP is almost exclusively methylated in bladder cancer.

Authors:  Jolein Mijnes; Jürgen Veeck; Nadine T Gaisa; Eduard Burghardt; Tim C de Ruijter; Sonja Gostek; Edgar Dahl; David Pfister; Sebastian C Schmid; Ruth Knüchel; Michael Rose
Journal:  Clin Epigenetics       Date:  2018-02-06       Impact factor: 6.551

3.  Exploring the longitudinal dynamics of herd BVD antibody test results using model-based clustering.

Authors:  J I Eze; G T Innocent; K Adam; S Huntley; G J Gunn
Journal:  Sci Rep       Date:  2019-08-06       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.