Literature DB >> 17970821

Variable selection for model-based high-dimensional clustering and its application to microarray data.

Sijian Wang1, Ji Zhu.   

Abstract

Variable selection in high-dimensional clustering analysis is an important yet challenging problem. In this article, we propose two methods that simultaneously separate data points into similar clusters and select informative variables that contribute to the clustering. Our methods are in the framework of penalized model-based clustering. Unlike the classical L(1)-norm penalization, the penalty terms that we propose make use of the fact that parameters belonging to one variable should be treated as a natural "group." Numerical results indicate that the two new methods tend to remove noninformative variables more effectively and provide better clustering results than the L(1)-norm approach.

Mesh:

Year:  2007        PMID: 17970821     DOI: 10.1111/j.1541-0420.2007.00922.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  25 in total

1.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

2.  A statistical framework for Illumina DNA methylation arrays.

Authors:  Pei Fen Kuan; Sijian Wang; Xin Zhou; Haitao Chu
Journal:  Bioinformatics       Date:  2010-09-29       Impact factor: 6.937

3.  Adaptive regularization using the entire solution surface.

Authors:  S Wu; X Shen; C J Geyer
Journal:  Biometrika       Date:  2009-09       Impact factor: 2.445

4.  Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Bioinformatics       Date:  2009-12-23       Impact factor: 6.937

5.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

6.  Statistical Significance of Clustering using Soft Thresholding.

Authors:  Hanwen Huang; Yufeng Liu; Ming Yuan; J S Marron
Journal:  J Comput Graph Stat       Date:  2015-12-10       Impact factor: 2.302

7.  A framework for feature selection in clustering.

Authors:  Daniela M Witten; Robert Tibshirani
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

8.  SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

Authors:  Ronglai Shen; Sijian Wang; Qianxing Mo
Journal:  Ann Appl Stat       Date:  2013-04-09       Impact factor: 2.083

9.  Sparse cluster analysis of large-scale discrete variables with application to single nucleotide polymorphism data.

Authors:  Baolin Wu
Journal:  J Appl Stat       Date:  2012-11-21       Impact factor: 1.404

10.  Filtering genes for cluster and network analysis.

Authors:  David Tritchler; Elena Parkhomenko; Joseph Beyene
Journal:  BMC Bioinformatics       Date:  2009-06-23       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.