Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Bayesian infinite mixture model based clustering of gene expression profiles.

Literature DB >> 12217911

Bayesian infinite mixture model based clustering of gene expression profiles.

Abstract

MOTIVATION: The biologic significance of results obtained through cluster analyses of gene expression data generated in microarray experiments have been demonstrated in many studies. In this article we focus on the development of a clustering procedure based on the concept of Bayesian model-averaging and a precise statistical model of expression data.
RESULTS: We developed a clustering procedure based on the Bayesian infinite mixture model and applied it to clustering gene expression profiles. Clusters of genes with similar expression patterns are identified from the posterior distribution of clusterings defined implicitly by the stochastic data-generation model. The posterior distribution of clusterings is estimated by a Gibbs sampler. We summarized the posterior distribution of clusterings by calculating posterior pairwise probabilities of co-expression and used the complete linkage principle to create clusters. This approach has several advantages over usual clustering procedures. The analysis allows for incorporation of a reasonable probabilistic model for generating data. The method does not require specifying the number of clusters and resulting optimal clustering is obtained by averaging over models with all possible numbers of clusters. Expression profiles that are not similar to any other profile are automatically detected, the method incorporates experimental replicates, and it can be extended to accommodate missing data. This approach represents a qualitative shift in the model-based cluster analysis of expression data because it allows for incorporation of uncertainties involved in the model selection in the final assessment of confidence in similarities of expression profiles. We also demonstrated the importance of incorporating the information on experimental variability into the clustering model. AVAILABILITY: The MS Windows(TM) based program implementing the Gibbs sampler and supplemental material is available at http://homepages.uc.edu/~medvedm/BioinformaticsSupplement.htm CONTACT: medvedm@email.uc.edu

Mesh：

Year: 2002 PMID： 12217911 DOI： 10.1093/bioinformatics/18.9.1194

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

64 in total

1. Context-specific infinite mixtures for clustering gene expression profiles across diverse microarray dataset.

Authors: X Liu; S Sivaganesan; K Y Yeung; J Guo; R E Bumgarner; Mario Medvedovic
Journal: Bioinformatics Date: 2006-05-18 Impact factor: 6.937

2. ConceptGen: a gene set enrichment and gene set relation mapping tool.

Authors: Maureen A Sartor; Vasudeva Mahavisno; Venkateshwar G Keshamouni; James Cavalcoli; Zachary Wright; Alla Karnovsky; Rork Kuick; H V Jagadish; Barbara Mirel; Terry Weymouth; Brian Athey; Gilbert S Omenn
Journal: Bioinformatics Date: 2009-12-09 Impact factor: 6.937

3. Microarray analysis of cytoplasmic versus whole cell RNA reveals a considerable number of missed and false positive mRNAs.

Authors: Heidi W Trask; Richard Cowper-Sal-lari; Maureen A Sartor; Jiang Gui; Catherine V Heath; Janhavi Renuka; Azara-Jane Higgins; Peter Andrews; Murray Korc; Jason H Moore; Craig R Tomlinson
Journal: RNA Date: 2009-08-24 Impact factor: 4.942

4. Model-Based Clustering With Data Correction For Removing Artifacts In Gene Expression Data.

Authors: William Chad Young; Adrian E Raftery; Ka Yee Yeung
Journal: Ann Appl Stat Date: 2017-12-28 Impact factor: 2.083

5. Nonparametric spatial models for clustered ordered periodontal data.

Authors: Dipankar Bandyopadhyay; Antonio Canale
Journal: J R Stat Soc Ser C Appl Stat Date: 2016-04-14 Impact factor: 1.864

6. Robust Clustering with Subpopulation-specific Deviations.

Authors: Briana J K Stephenson; Amy H Herring; Andrew Olshan
Journal: J Am Stat Assoc Date: 2019-06-19 Impact factor: 5.033

7. Query large scale microarray compendium datasets using a model-based bayesian approach with variable selection.

Authors: Ming Hu; Zhaohui S Qin
Journal: PLoS One Date: 2009-02-13 Impact factor: 3.240

8. Expression profiles of switch-like genes accurately classify tissue and infectious disease phenotypes in model-based classification.

Authors: Michael Gormley; Aydin Tozeren
Journal: BMC Bioinformatics Date: 2008-11-17 Impact factor: 3.169

9. MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering.

Authors: Eun-Youn Kim; Seon-Young Kim; Daniel Ashlock; Dougu Nam
Journal: BMC Bioinformatics Date: 2009-08-22 Impact factor: 3.169

10. AutoClass@IJM: a powerful tool for Bayesian classification of heterogeneous data in biology.

Authors: Fiona Achcar; Jean-Michel Camadro; Denis Mestivier
Journal: Nucleic Acids Res Date: 2009-05-27 Impact factor: 16.971