Literature DB >> 19912170

Pairwise variable selection for high-dimensional model-based clustering.

Jian Guo1, Elizaveta Levina, George Michailidis, Ji Zhu.   

Abstract

Variable selection for clustering is an important and challenging problem in high-dimensional data analysis. Existing variable selection methods for model-based clustering select informative variables in a "one-in-all-out" manner; that is, a variable is selected if at least one pair of clusters is separable by this variable and removed if it cannot separate any of the clusters. In many applications, however, it is of interest to further establish exactly which clusters are separable by each informative variable. To address this question, we propose a pairwise variable selection method for high-dimensional model-based clustering. The method is based on a new pairwise penalty. Results on simulated and real data show that the new method performs better than alternative approaches that use ℓ(1) and ℓ(∞) penalties and offers better interpretation.
© 2009, The International Biometric Society.

Entities:  

Mesh:

Year:  2010        PMID: 19912170      PMCID: PMC2888949          DOI: 10.1111/j.1541-0420.2009.01341.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  7 in total

1.  Variable selection for model-based high-dimensional clustering and its application to microarray data.

Authors:  Sijian Wang; Ji Zhu
Journal:  Biometrics       Date:  2007-10-26       Impact factor: 2.571

2.  Mixture models with multiple levels, with application to the analysis of multifactor gene expression data.

Authors:  Rebecka Jörnsten; Sündüz Keleş
Journal:  Biostatistics       Date:  2008-02-05       Impact factor: 5.899

3.  Variable Selection using MM Algorithms.

Authors:  David R Hunter; Runze Li
Journal:  Ann Stat       Date:  2005       Impact factor: 4.028

4.  Simultaneous factor selection and collapsing levels in ANOVA.

Authors:  Howard D Bondell; Brian J Reich
Journal:  Biometrics       Date:  2008-05-28       Impact factor: 2.571

5.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

6.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks.

Authors:  J Khan; J S Wei; M Ringnér; L H Saal; M Ladanyi; F Westermann; F Berthold; M Schwab; C R Antonescu; C Peterson; P S Meltzer
Journal:  Nat Med       Date:  2001-06       Impact factor: 53.440

7.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.

Authors:  Eng-Juh Yeoh; Mary E Ross; Sheila A Shurtleff; W Kent Williams; Divyen Patel; Rami Mahfouz; Fred G Behm; Susana C Raimondi; Mary V Relling; Anami Patel; Cheng Cheng; Dario Campana; Dawn Wilkins; Xiaodong Zhou; Jinyan Li; Huiqing Liu; Ching-Hon Pui; William E Evans; Clayton Naeve; Limsoon Wong; James R Downing
Journal:  Cancer Cell       Date:  2002-03       Impact factor: 31.743

  7 in total
  7 in total

1.  Comparing Model Selection and Regularization Approaches to Variable Selection in Model-Based Clustering.

Authors:  Gilles Celeux; Marie-Laure Martin-Magniette; Cathy Maugis-Rabusseau; Adrian E Raftery
Journal:  J Soc Fr Statistique (2009)       Date:  2014

2.  Penalized model-based clustering with unconstrained covariance matrices.

Authors:  Hui Zhou; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2009-01-01       Impact factor: 1.125

3.  Supervised Bayesian latent class models for high-dimensional data.

Authors:  Stacia M Desantis; E Andrés Houseman; Brent A Coull; Catherine L Nutt; Rebecca A Betensky
Journal:  Stat Med       Date:  2012-04-11       Impact factor: 2.373

4.  Integrative clustering methods for multi-omics data.

Authors:  Xiaoyu Zhang; Zhenwei Zhou; Hanfei Xu; Ching-Ti Liu
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2021-02-07

5.  Identifying Heterogeneous Effect using Latent Supervised Clustering with Adaptive Fusion.

Authors:  Jingxiang Chen; Quoc Tran-Dinh; Michael R Kosorok; Yufeng Liu
Journal:  J Comput Graph Stat       Date:  2020-06-30       Impact factor: 1.884

6.  Covariance-enhanced discriminant analysis.

Authors:  Peirong Xu; J I Zhu; Lixing Zhu; Y I Li
Journal:  Biometrika       Date:  2014-12-03       Impact factor: 2.445

7.  Clustering High-Dimensional Landmark-based Two-dimensional Shape Data.

Authors:  Chao Huang; Martin Styner; Hongtu Zhu
Journal:  J Am Stat Assoc       Date:  2015-04-16       Impact factor: 5.033

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.