Literature DB >> 18162109

Variable selection in penalized model-based clustering via regularization on grouped parameters.

Benhuai Xie1, Wei Pan1, Xiaotong Shen2.   

Abstract

Penalized model-based clustering has been proposed for high-dimensional but small sample-sized data, such as arising from genomic studies; in particular, it can be used for variable selection. A new regularization scheme is proposed to group together multiple parameters of the same variable across clusters, which is shown both analytically and numerically to be more effective than the conventional L(1) penalty for variable selection. In addition, we develop a strategy to combine this grouping scheme with grouping structured variables. Simulation studies and applications to microarray gene expression data for cancer subtype discovery demonstrate the advantage of the new proposal over several existing approaches.

Entities:  

Mesh:

Year:  2007        PMID: 18162109     DOI: 10.1111/j.1541-0420.2007.00955.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  8 in total

1.  Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Bioinformatics       Date:  2009-12-23       Impact factor: 6.937

2.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

3.  Clustering of High Throughput Gene Expression Data.

Authors:  Harun Pirim; Burak Ekşioğlu; Andy Perkins; Cetin Yüceer
Journal:  Comput Oper Res       Date:  2012-12       Impact factor: 4.008

4.  Penalized model-based clustering with unconstrained covariance matrices.

Authors:  Hui Zhou; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2009-01-01       Impact factor: 1.125

5.  Sparse cluster analysis of large-scale discrete variables with application to single nucleotide polymorphism data.

Authors:  Baolin Wu
Journal:  J Appl Stat       Date:  2012-11-21       Impact factor: 1.404

6.  Clustering and variable selection in the presence of mixed variable types and missing data.

Authors:  C B Storlie; S M Myers; S K Katusic; A L Weaver; R G Voigt; P E Croarkin; R E Stoeckel; J D Port
Journal:  Stat Med       Date:  2018-05-17       Impact factor: 2.373

7.  Filtering genes for cluster and network analysis.

Authors:  David Tritchler; Elena Parkhomenko; Joseph Beyene
Journal:  BMC Bioinformatics       Date:  2009-06-23       Impact factor: 3.169

8.  Model-based clustering based on sparse finite Gaussian mixtures.

Authors:  Gertraud Malsiner-Walli; Sylvia Frühwirth-Schnatter; Bettina Grün
Journal:  Stat Comput       Date:  2014-08-26       Impact factor: 2.559

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.