Literature DB >> 28620908

A Dirichlet process mixture model for clustering longitudinal gene expression data.

Jiehuan Sun1, Jose D Herazo-Maya2, Naftali Kaminski2, Hongyu Zhao1, Joshua L Warren1.   

Abstract

Subgroup identification (clustering) is an important problem in biomedical research. Gene expression profiles are commonly utilized to define subgroups. Longitudinal gene expression profiles might provide additional information on disease progression than what is captured by baseline profiles alone. Therefore, subgroup identification could be more accurate and effective with the aid of longitudinal gene expression data. However, existing statistical methods are unable to fully utilize these data for patient clustering. In this article, we introduce a novel clustering method in the Bayesian setting based on longitudinal gene expression profiles. This method, called BClustLonG, adopts a linear mixed-effects framework to model the trajectory of genes over time, while clustering is jointly conducted based on the regression coefficients obtained from all genes. In order to account for the correlations among genes and alleviate the high dimensionality challenges, we adopt a factor analysis model for the regression coefficients. The Dirichlet process prior distribution is utilized for the means of the regression coefficients to induce clustering. Through extensive simulation studies, we show that BClustLonG has improved performance over other clustering methods. When applied to a dataset of severely injured (burn or trauma) patients, our model is able to identify interesting subgroups.
Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

Entities:  

Keywords:  Bayesian factor analysis; Bayesian nonparametrics; clustering; longitudinal gene expression study

Mesh:

Year:  2017        PMID: 28620908      PMCID: PMC5583037          DOI: 10.1002/sim.7374

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  19 in total

1.  Principal component analysis for clustering gene expression data.

Authors:  K Y Yeung; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

2.  Generalized common spatial factor model.

Authors:  Fujun Wang; Melanie M Wall
Journal:  Biostatistics       Date:  2003-10       Impact factor: 5.899

3.  Bayesian mixture model based clustering of replicated microarray data.

Authors:  M Medvedovic; K Y Yeung; R E Bumgarner
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

4.  Variable selection for clustering with Gaussian mixture models.

Authors:  Cathy Maugis; Gilles Celeux; Marie-Laure Martin-Magniette
Journal:  Biometrics       Date:  2009-02-04       Impact factor: 2.571

5.  Cluster analysis using multivariate mixed effects models.

Authors:  Luis Villarroel; Guillermo Marshall; Anna E Barón
Journal:  Stat Med       Date:  2009-09-10       Impact factor: 2.373

6.  Sparse Bayesian infinite factor models.

Authors:  A Bhattacharya; D B Dunson
Journal:  Biometrika       Date:  2011-06       Impact factor: 2.445

Review 7.  Disentangling the heterogeneity of autism spectrum disorder through genetic findings.

Authors:  Shafali S Jeste; Daniel H Geschwind
Journal:  Nat Rev Neurol       Date:  2014-01-28       Impact factor: 42.937

8.  Molecular profiling of non-small cell lung cancer and correlation with disease-free survival.

Authors:  Dennis A Wigle; Igor Jurisica; Niki Radulovich; Melania Pintilie; Janet Rossant; Ni Liu; Chao Lu; James Woodgett; Isolde Seiden; Michael Johnston; Shaf Keshavjee; Gail Darling; Timothy Winton; Bobby-Joe Breitkreutz; Paul Jorgenson; Mike Tyers; Frances A Shepherd; Ming Sound Tsao
Journal:  Cancer Res       Date:  2002-06-01       Impact factor: 12.701

9.  Personal omics profiling reveals dynamic molecular and medical phenotypes.

Authors:  Rui Chen; George I Mias; Jennifer Li-Pook-Than; Lihua Jiang; Hugo Y K Lam; Rong Chen; Elana Miriami; Konrad J Karczewski; Manoj Hariharan; Frederick E Dewey; Yong Cheng; Michael J Clark; Hogune Im; Lukas Habegger; Suganthi Balasubramanian; Maeve O'Huallachain; Joel T Dudley; Sara Hillenmeyer; Rajini Haraksingh; Donald Sharon; Ghia Euskirchen; Phil Lacroute; Keith Bettinger; Alan P Boyle; Maya Kasowski; Fabian Grubert; Scott Seki; Marco Garcia; Michelle Whirl-Carrillo; Mercedes Gallardo; Maria A Blasco; Peter L Greenberg; Phyllis Snyder; Teri E Klein; Russ B Altman; Atul J Butte; Euan A Ashley; Mark Gerstein; Kari C Nadeau; Hua Tang; Michael Snyder
Journal:  Cell       Date:  2012-03-16       Impact factor: 41.582

Review 10.  Tumour heterogeneity and cancer cell plasticity.

Authors:  Corbin E Meacham; Sean J Morrison
Journal:  Nature       Date:  2013-09-19       Impact factor: 49.962

View more
  4 in total

1.  A Bayesian multiple imputation approach to bivariate functional data with missing components.

Authors:  Jeong Hoon Jang; Amita K Manatunga; Changgee Chang; Qi Long
Journal:  Stat Med       Date:  2021-06-08       Impact factor: 2.497

2.  Host transcriptional response to TB preventive therapy differentiates two sub-groups of IGRA-positive individuals.

Authors:  Claire Broderick; Jacqueline M Cliff; Ji-Sook Lee; Myrsini Kaforou; David Aj Moore
Journal:  Tuberculosis (Edinb)       Date:  2020-11-28       Impact factor: 3.131

3.  A novel computational strategy for DNA methylation imputation using mixture regression model (MRM).

Authors:  Fangtang Yu; Chao Xu; Hong-Wen Deng; Hui Shen
Journal:  BMC Bioinformatics       Date:  2020-12-01       Impact factor: 3.169

4.  Factors Predicting Detrimental Change in Declarative Memory Among Women With HIV: A Study of Heterogeneity in Cognition.

Authors:  Kathryn C Fitzgerald; Pauline M Maki; Yanxun Xu; Wei Jin; Raha Dastgheyb; Dionna W Williams; Gayle Springer; Kathryn Anastos; Deborah Gustafson; Amanda B Spence; Adaora A Adimora; Drenna Waldrop; David E Vance; Hector Bolivar; Victor G Valcour; Leah H Rubin
Journal:  Front Psychol       Date:  2020-10-15
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.