Literature DB >> 28343169

A Bayesian semiparametric factor analysis model for subtype identification.

Jiehuan Sun1, Joshua L Warren1, Hongyu Zhao1.   

Abstract

Disease subtype identification (clustering) is an important problem in biomedical research. Gene expression profiles are commonly utilized to infer disease subtypes, which often lead to biologically meaningful insights into disease. Despite many successes, existing clustering methods may not perform well when genes are highly correlated and many uninformative genes are included for clustering due to the high dimensionality. In this article, we introduce a novel subtype identification method in the Bayesian setting based on gene expression profiles. This method, called BCSub, adopts an innovative semiparametric Bayesian factor analysis model to reduce the dimension of the data to a few factor scores for clustering. Specifically, the factor scores are assumed to follow the Dirichlet process mixture model in order to induce clustering. Through extensive simulation studies, we show that BCSub has improved performance over commonly used clustering methods. When applied to two gene expression datasets, our model is able to identify subtypes that are clinically more relevant than those identified from the existing methods.

Entities:  

Keywords:  Bayesian factor analysis; Bayesian nonparametrics; Dirichlet process; clustering; gene expression study

Mesh:

Year:  2017        PMID: 28343169      PMCID: PMC5545128          DOI: 10.1515/sagmb-2016-0051

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  20 in total

1.  Principal component analysis for clustering gene expression data.

Authors:  K Y Yeung; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

2.  Bayesian mixture model based clustering of replicated microarray data.

Authors:  M Medvedovic; K Y Yeung; R E Bumgarner
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

3.  Bayesian Gaussian Copula Factor Models for Mixed Data.

Authors:  Jared S Murray; David B Dunson; Lawrence Carin; Joseph E Lucas
Journal:  J Am Stat Assoc       Date:  2013-06-01       Impact factor: 5.033

4.  Clustering microarray gene expression data using weighted Chinese restaurant process.

Authors:  Zhaohui S Qin
Journal:  Bioinformatics       Date:  2006-06-09       Impact factor: 6.937

5.  Reducing microarray data via nonnegative matrix factorization for visualization and clustering analysis.

Authors:  Weixiang Liu; Kehong Yuan; Datian Ye
Journal:  J Biomed Inform       Date:  2007-12-23       Impact factor: 6.317

Review 6.  Disentangling the heterogeneity of autism spectrum disorder through genetic findings.

Authors:  Shafali S Jeste; Daniel H Geschwind
Journal:  Nat Rev Neurol       Date:  2014-01-28       Impact factor: 42.937

7.  Molecular profiling of non-small cell lung cancer and correlation with disease-free survival.

Authors:  Dennis A Wigle; Igor Jurisica; Niki Radulovich; Melania Pintilie; Janet Rossant; Ni Liu; Chao Lu; James Woodgett; Isolde Seiden; Michael Johnston; Shaf Keshavjee; Gail Darling; Timothy Winton; Bobby-Joe Breitkreutz; Paul Jorgenson; Mike Tyers; Frances A Shepherd; Ming Sound Tsao
Journal:  Cancer Res       Date:  2002-06-01       Impact factor: 12.701

8.  Molecular portraits of human breast tumours.

Authors:  C M Perou; T Sørlie; M B Eisen; M van de Rijn; S S Jeffrey; C A Rees; J R Pollack; D T Ross; H Johnsen; L A Akslen; O Fluge; A Pergamenschikov; C Williams; S X Zhu; P E Lønning; A L Børresen-Dale; P O Brown; D Botstein
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

9.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

10.  The Impact of Homogeneous Versus Heterogeneous Emphysema on Dynamic Hyperinflation in Patients With Severe COPD Assessed for Lung Volume Reduction.

Authors:  Afroditi K Boutou; Zaid Zoumot; Arjun Nair; Claire Davey; David M Hansell; Athanasios Jamurtas; Michael I Polkey; Nicholas S Hopkinson
Journal:  COPD       Date:  2015-09-23       Impact factor: 2.409

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.