Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Variable selection for model-based high-dimensional clustering and its application to microarray data.

Literature DB >> 17970821

Variable selection for model-based high-dimensional clustering and its application to microarray data.

Abstract

Variable selection in high-dimensional clustering analysis is an important yet challenging problem. In this article, we propose two methods that simultaneously separate data points into similar clusters and select informative variables that contribute to the clustering. Our methods are in the framework of penalized model-based clustering. Unlike the classical L(1)-norm penalization, the penalty terms that we propose make use of the fact that parameters belonging to one variable should be treated as a natural "group." Numerical results indicate that the two new methods tend to remove noninformative variables more effectively and provide better clustering results than the L(1)-norm approach.

Mesh：

Year: 2007 PMID： 17970821 DOI： 10.1111/j.1541-0420.2007.00922.x

Source DB: PubMed Journal: Biometrics ISSN： 0006-341X Impact factor: 2.571

Keyword Cloud
Cited

25 in total

Variable selection for model-based high-dimensional clustering and its application to microarray data.

1. Sparse Biclustering of Transposable Data.

2. A statistical framework for Illumina DNA methylation arrays.

3. Adaptive regularization using the entire solution surface.

4. Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.

5. Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

6. Statistical Significance of Clustering using Soft Thresholding.

7. A framework for feature selection in clustering.

8. SPARSE INTEGRATIVE CLUSTERING OF MULTIPLE OMICS DATA SETS.

9. Sparse cluster analysis of large-scale discrete variables with application to single nucleotide polymorphism data.

10. Filtering genes for cluster and network analysis.