Literature DB >> 24358018

Cluster Analysis: Unsupervised Learning via Supervised Learning with a Non-convex Penalty.

Wei Pan1, Xiaotong Shen2, Binghui Liu3.   

Abstract

Clustering analysis is widely used in many fields. Traditionally clustering is regarded as unsupervised learning for its lack of a class label or a quantitative response variable, which in contrast is present in supervised learning such as classification and regression. Here we formulate clustering as penalized regression with grouping pursuit. In addition to the novel use of a non-convex group penalty and its associated unique operating characteristics in the proposed clustering method, a main advantage of this formulation is its allowing borrowing some well established results in classification and regression, such as model selection criteria to select the number of clusters, a difficult problem in clustering analysis. In particular, we propose using the generalized cross-validation (GCV) based on generalized degrees of freedom (GDF) to select the number of clusters. We use a few simple numerical examples to compare our proposed method with some existing approaches, demonstrating our method's promising performance.

Entities:  

Keywords:  Generalized degrees of freedom; Grouping; K-means clustering; Lasso; Penalized regression; Truncated lasso penalty (TLP)

Year:  2013        PMID: 24358018      PMCID: PMC3866036     

Source DB:  PubMed          Journal:  J Mach Learn Res        ISSN: 1532-4435            Impact factor:   3.654


  6 in total

1.  Likelihood-based selection and sharp parameter estimation.

Authors:  Xiaotong Shen; Wei Pan; Yunzhang Zhu
Journal:  J Am Stat Assoc       Date:  2012-06-11       Impact factor: 5.033

Review 2.  Survey of clustering algorithms.

Authors:  Rui Xu; Donald Wunsch
Journal:  IEEE Trans Neural Netw       Date:  2005-05

3.  Evaluation and comparison of gene clustering methods in microarray analysis.

Authors:  Anbupalam Thalamuthu; Indranil Mukhopadhyay; Xiaojing Zheng; George C Tseng
Journal:  Bioinformatics       Date:  2006-07-31       Impact factor: 6.937

4.  K-means clustering: a half-century synthesis.

Authors:  Douglas Steinley
Journal:  Br J Math Stat Psychol       Date:  2006-05       Impact factor: 3.380

5.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

6.  Grouping pursuit through a regularization solution surface.

Authors:  Xiaotong Shen; Hsin-Cheng Huang
Journal:  J Am Stat Assoc       Date:  2010-06-01       Impact factor: 5.033

  6 in total
  10 in total

1.  Statistical modelling of citation exchange between statistics journals.

Authors:  Cristiano Varin; Manuela Cattelan; David Firth
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2015-11-03       Impact factor: 2.483

2.  A New Algorithm and Theory for Penalized Regression-based Clustering.

Authors:  Chong Wu; Sunghoon Kwon; Xiaotong Shen; Wei Pan
Journal:  J Mach Learn Res       Date:  2016       Impact factor: 3.654

3.  Clustering of Data with Missing Entries using Non-convex Fusion Penalties.

Authors:  Sunrita Poddar; Mathews Jacob
Journal:  IEEE Trans Signal Process       Date:  2019-09-30       Impact factor: 4.931

4.  MODEL-BASED FEATURE SELECTION AND CLUSTERING OF RNA-SEQ DATA FOR UNSUPERVISED SUBTYPE DISCOVERY.

Authors:  David K Lim; Naim U Rashid; Joseph G Ibrahim
Journal:  Ann Appl Stat       Date:  2021-03-18       Impact factor: 2.083

5.  Small area mean estimation after effect clustering.

Authors:  Zhihuang Yang; Jiahua Chen
Journal:  J Appl Stat       Date:  2019-07-30       Impact factor: 1.416

6.  Fused Lasso Approach in Regression Coefficients Clustering - Learning Parameter Heterogeneity in Data Integration.

Authors:  Lu Tang; Peter X K Song
Journal:  J Mach Learn Res       Date:  2016       Impact factor: 3.654

7.  Characterizing classes of fibromyalgia within the continuum of central sensitization syndrome.

Authors:  Fred Davis; Mark Gostine; Bradley Roberts; Rebecca Risko; Joseph C Cappelleri; Alesia Sadosky
Journal:  J Pain Res       Date:  2018-10-23       Impact factor: 3.133

8.  The evolution of online ideological communities.

Authors:  Brittany I Davidson; Simon L Jones; Adam N Joinson; Joanne Hinds
Journal:  PLoS One       Date:  2019-05-22       Impact factor: 3.240

9.  Semi-Supervised Topological Analysis for Elucidating Hidden Structures in High-Dimensional Transcriptome Datasets.

Authors:  Tianshu Feng; Jaime I Davila; Yuanhang Liu; Sangdi Lin; Shuai Huang; Chen Wang
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2021-08-06       Impact factor: 3.710

10.  Provable Convex Co-clustering of Tensors.

Authors:  Eric C Chi; Brian R Gaines; Will Wei Sun; Hua Zhou; Jian Yang
Journal:  J Mach Learn Res       Date:  2020       Impact factor: 5.177

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.