Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Pairwise variable selection for high-dimensional model-based clustering.

Literature DB >> 19912170

Pairwise variable selection for high-dimensional model-based clustering.

Jian Guo¹, Elizaveta Levina, George Michailidis, Ji Zhu.

Abstract

Variable selection for clustering is an important and challenging problem in high-dimensional data analysis. Existing variable selection methods for model-based clustering select informative variables in a "one-in-all-out" manner; that is, a variable is selected if at least one pair of clusters is separable by this variable and removed if it cannot separate any of the clusters. In many applications, however, it is of interest to further establish exactly which clusters are separable by each informative variable. To address this question, we propose a pairwise variable selection method for high-dimensional model-based clustering. The method is based on a new pairwise penalty. Results on simulated and real data show that the new method performs better than alternative approaches that use ℓ(1) and ℓ(∞) penalties and offers better interpretation.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2010 PMID： 19912170 PMCID： PMC2888949 DOI： 10.1111/j.1541-0420.2009.01341.x

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

7 in total

1. Variable selection for model-based high-dimensional clustering and its application to microarray data.

Authors: Sijian Wang; Ji Zhu
Journal: Biometrics Date: 2007-10-26 Impact factor: 2.571

2. Mixture models with multiple levels, with application to the analysis of multifactor gene expression data.

Authors: Rebecka Jörnsten; Sündüz Keleş
Journal: Biostatistics Date: 2008-02-05 Impact factor: 5.899

3. Variable Selection using MM Algorithms.

Authors: David R Hunter; Runze Li
Journal: Ann Stat Date: 2005 Impact factor: 4.028

4. Simultaneous factor selection and collapsing levels in ANOVA.

Authors: Howard D Bondell; Brian J Reich
Journal: Biometrics Date: 2008-05-28 Impact factor: 2.571

5. Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors: Benhuai Xie; Wei Pan; Xiaotong Shen
Journal: Electron J Stat Date: 2008 Impact factor: 1.125

6. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors: J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal: Nat Med Date: 2001-06 Impact factor: 53.440

7. Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.

Authors: Eng-Juh Yeoh; Mary E Ross; Sheila A Shurtleff; W Kent Williams; Divyen Patel; Rami Mahfouz; Fred G Behm; Susana C Raimondi; Mary V Relling; Anami Patel; Cheng Cheng; Dario Campana; Dawn Wilkins; Xiaodong Zhou; Jinyan Li; Huiqing Liu; Ching-Hon Pui; William E Evans; Clayton Naeve; Limsoon Wong; James R Downing
Journal: Cancer Cell Date: 2002-03 Impact factor: 31.743

7 in total

Pairwise variable selection for high-dimensional model-based clustering.

1. Variable selection for model-based high-dimensional clustering and its application to microarray data.

2. Mixture models with multiple levels, with application to the analysis of multifactor gene expression data.

3. Variable Selection using MM Algorithms.

4. Simultaneous factor selection and collapsing levels in ANOVA.

5. Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

6. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

7. Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.

1. Comparing Model Selection and Regularization Approaches to Variable Selection in Model-Based Clustering.

2. Penalized model-based clustering with unconstrained covariance matrices.

3. Supervised Bayesian latent class models for high-dimensional data.

4. Integrative clustering methods for multi-omics data.

5. Identifying Heterogeneous Effect using Latent Supervised Clustering with Adaptive Fusion.

6. Covariance-enhanced discriminant analysis.

7. Clustering High-Dimensional Landmark-based Two-dimensional Shape Data^‡.