Literature DB >> 16401276

Subset clustering of binary sequences, with an application to genomic abnormality data.

Peter D Hoff1.   

Abstract

This article develops a model-based approach to clustering multivariate binary data, in which the attributes that distinguish a cluster from the rest of the population may depend on the cluster being considered. The clustering approach is based on a multivariate Dirichlet process mixture model, which allows for the estimation of the number of clusters, the cluster memberships, and the cluster-specific parameters in a unified way. Such a clustering approach has applications in the analysis of genomic abnormality data, in which the development of different types of tumors may depend on the presence of certain abnormalities at subsets of locations along the genome. Additionally, such a mixture model provides a nonparametric estimation scheme for dependent sequences of binary data.

Entities:  

Mesh:

Year:  2005        PMID: 16401276     DOI: 10.1111/j.1541-0420.2005.00381.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  2 in total

1.  Nonparametric Bayes Modeling of Multivariate Categorical Data.

Authors:  David B Dunson; Chuanhua Xing
Journal:  J Am Stat Assoc       Date:  2012-01-01       Impact factor: 5.033

2.  Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features.

Authors:  J R Semeiks; A Rizki; M J Bissell; I S Mian
Journal:  BMC Bioinformatics       Date:  2006-03-16       Impact factor: 3.169

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.