Literature DB >> 24222928

A Nonparametric Bayesian Model for Local Clustering with Application to Proteomics.

Juhee Lee1, Peter Müller, Yitan Zhu, Yuan Ji.   

Abstract

We propose a nonparametric Bayesian local clustering (NoB-LoC) approach for heterogeneous data. NoB-LoC implements inference for nested clusters as posterior inference under a Bayesian model. Using protein expression data as an example, the NoB-LoC model defines a protein (column) cluster as a set of proteins that give rise to the same partition of the samples (rows). In other words, the sample partitions are nested within protein clusters. The common clustering of the samples gives meaning to the protein clusters. Any pair of samples might belong to the same cluster for one protein set but to different clusters for another protein set. These local features are different from features obtained by global clustering approaches such as hierarchical clustering, which create only one partition of samples that applies for all the proteins in the data set. In addition, the NoB-LoC model is different from most other local or nested clustering methods, which define clusters based on common parameters in the sampling model. As an added and important feature, the NoB-LoC method probabilistically excludes sets of irrelevant proteins and samples that do not meaningfully co-cluster with other proteins and samples, thus improving the inference on the clustering of the remaining proteins and samples. Inference is guided by a joint probability model for all the random elements. We provide a simulation study and a motivating example to demonstrate the unique features of the NoB-LoC model.

Entities:  

Keywords:  Dirichlet Process; Protein Expression; Pólya Urn; RPPA; Random Partitions

Year:  2013        PMID: 24222928      PMCID: PMC3821783          DOI: 10.1080/01621459.2013.784705

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   5.033


  15 in total

1.  Identification of human triple-negative breast cancer subtypes and preclinical models for selection of targeted therapies.

Authors:  Brian D Lehmann; Joshua A Bauer; Xi Chen; Melinda E Sanders; A Bapsi Chakravarthy; Yu Shyr; Jennifer A Pietenpol
Journal:  J Clin Invest       Date:  2011-07       Impact factor: 14.808

2.  Bayesian infinite mixture model based clustering of gene expression profiles.

Authors:  Mario Medvedovic; Siva Sivaganesan
Journal:  Bioinformatics       Date:  2002-09       Impact factor: 6.937

3.  Bayesian mixture model based clustering of replicated microarray data.

Authors:  M Medvedovic; K Y Yeung; R E Bumgarner
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

Review 4.  Clustering methods for microarray gene expression data.

Authors:  Nabil Belacel; Qian Wang; Miroslava Cuperlovic-Culf
Journal:  OMICS       Date:  2006

5.  A Bayesian subgroup analysis with a zero-enriched Polya Urn scheme.

Authors:  S Sivaganesan; Purushottam W Laud; Peter Müller
Journal:  Stat Med       Date:  2010-11-05       Impact factor: 2.373

6.  Reverse phase protein array: validation of a novel proteomic technology and utility for analysis of primary leukemia specimens and hematopoietic stem cells.

Authors:  Raoul Tibes; Yihua Qiu; Yiling Lu; Bryan Hennessy; Michael Andreeff; Gordon B Mills; Steven M Kornblau
Journal:  Mol Cancer Ther       Date:  2006-10       Impact factor: 6.261

7.  A semi-parametric Bayesian model for unsupervised differential co-expression analysis.

Authors:  Johannes M Freudenberg; Siva Sivaganesan; Michael Wagner; Mario Medvedovic
Journal:  BMC Bioinformatics       Date:  2010-05-07       Impact factor: 3.169

8.  Nonparametric Bayes local partition models for random effects.

Authors:  David B Dunson
Journal:  Biometrika       Date:  2009       Impact factor: 2.445

9.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.

Authors:  Christina Curtis; Sohrab P Shah; Suet-Feung Chin; Gulisa Turashvili; Oscar M Rueda; Mark J Dunning; Doug Speed; Andy G Lynch; Shamith Samarajiwa; Yinyin Yuan; Stefan Gräf; Gavin Ha; Gholamreza Haffari; Ali Bashashati; Roslin Russell; Steven McKinney; Anita Langerød; Andrew Green; Elena Provenzano; Gordon Wishart; Sarah Pinder; Peter Watson; Florian Markowetz; Leigh Murphy; Ian Ellis; Arnie Purushotham; Anne-Lise Børresen-Dale; James D Brenton; Simon Tavaré; Carlos Caldas; Samuel Aparicio
Journal:  Nature       Date:  2012-04-18       Impact factor: 49.962

10.  Comprehensive molecular portraits of human breast tumours.

Authors: 
Journal:  Nature       Date:  2012-09-23       Impact factor: 49.962

View more
  4 in total

1.  Sample selection in the face of design constraints: Use of clustering to define sample strata for qualitative research.

Authors:  Lane F Burgette; José J Escarce; Susan M Paddock; Marjorie S Ridgely; Warren G Wilder; Dolores Yanagihara; Cheryl L Damberg
Journal:  Health Serv Res       Date:  2018-12-11       Impact factor: 3.402

2.  Biclustering of medical monitoring data using a nonparametric hierarchical Bayesian model.

Authors:  Yan Ren; Siva Sivaganesan; Mekibib Altaye; Raouf S Amin; Rhonda D Szczesniak
Journal:  Stat (Int Stat Inst)       Date:  2020-03-15

3.  BAREB: A Bayesian repulsive biclustering model for periodontal data.

Authors:  Yuliang Li; Dipankar Bandyopadhyay; Fangzheng Xie; Yanxun Xu
Journal:  Stat Med       Date:  2020-04-03       Impact factor: 2.373

4.  Nonparametric Bayesian Bi-Clustering for Next Generation Sequencing Count Data.

Authors:  Yanxun Xu; Juhee Lee; Yuan Yuan; Riten Mitra; Shoudan Liang; Peter Müller; Yuan Ji
Journal:  Bayesian Anal       Date:  2013-12       Impact factor: 3.728

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.