Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A New Algorithm and Theory for Penalized Regression-based Clustering.

Literature DB >> 31662706

A New Algorithm and Theory for Penalized Regression-based Clustering.

Chong Wu¹, Sunghoon Kwon², Xiaotong Shen³, Wei Pan¹.

Abstract

Clustering is unsupervised and exploratory in nature. Yet, it can be performed through penalized regression with grouping pursuit, as demonstrated in Pan et al. (2013). In this paper, we develop a more efficient algorithm for scalable computation and a new theory of clustering consistency for the method. This algorithm, called DC-ADMM, combines difference of convex (DC) programming with the alternating direction method of multipliers (ADMM). This algorithm is shown to be more computationally efficient than the quadratic penalty based algorithm of Pan et al. (2013) because of the former's closed-form updating formulas. Numerically, we compare the DC-ADMM algorithm with the quadratic penalty algorithm to demonstrate its utility and scalability. Theoretically, we establish a finite-sample mis-clustering error bound for penalized regression based clustering with the L 0 constrained regularization in a general setting. On this ground, we provide conditions for clustering consistency of the penalized clustering method. As an end product, we put R package prclust implementing PRclust with various loss and grouping penalty functions available on GitHub and CRAN.

Entities: Chemical Disease Gene

Keywords: Alternating direction method of multipliers (ADMM); Clustering consistency; Difference of convex (DC) programming; Truncated L1-penalty (TLP)

Year: 2016 PMID： 31662706 PMCID： PMC6818515

Source DB: PubMed Journal: J Mach Learn Res ISSN： 1532-4435 Impact factor: 3.654

6 in total

1. Likelihood-based selection and sharp parameter estimation.

Authors: Xiaotong Shen; Wei Pan; Yunzhang Zhu
Journal: J Am Stat Assoc Date: 2012-06-11 Impact factor: 5.033

Review 2. Survey of clustering algorithms.

Authors: Rui Xu; Donald Wunsch
Journal: IEEE Trans Neural Netw Date: 2005-05

2 in total

1. Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data.

Authors: Minjie Wang; Genevera I Allen
Journal: J Mach Learn Res Date: 2021-01 Impact factor: 5.177

2. Provable Convex Co-clustering of Tensors.

Authors: Eric C Chi; Brian R Gaines; Will Wei Sun; Hua Zhou; Jian Yang
Journal: J Mach Learn Res Date: 2020 Impact factor: 5.177

2 in total

A New Algorithm and Theory for Penalized Regression-based Clustering.

1. Likelihood-based selection and sharp parameter estimation.

Review 2. Survey of clustering algorithms.

3. K-means clustering: a half-century synthesis.

4. Integrative and regularized principal component analysis of multiple sources of data.

5. Cluster Analysis: Unsupervised Learning via Supervised Learning with a Non-convex Penalty.

6. Splitting Methods for Convex Clustering.

1. Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data.

2. Provable Convex Co-clustering of Tensors.