Literature DB >> 32982130

Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization.

Michael Weylandt1, John Nagorski1, Genevera I Allen1,2,3,4.   

Abstract

Convex clustering is a promising new approach to the classical problem of clustering, combining strong performance in empirical studies with rigorous theoretical foundations. Despite these advantages, convex clustering has not been widely adopted, due to its computationally intensive nature and its lack of compelling visualizations. To address these impediments, we introduce Algorithmic Regularization, an innovative technique for obtaining high-quality estimates of regularization paths using an iterative one-step approximation scheme. We justify our approach with a novel theoretical result, guaranteeing global convergence of the approximate path to the exact solution under easily-checked non-data-dependent assumptions. The application of algorithmic regularization to convex clustering yields the Convex Clustering via Algorithmic Regularization Paths (CARP) algorithm for computing the clustering solution path. On example data sets from genomics and text analysis, CARP delivers over a 100-fold speed-up over existing methods, while attaining a finer approximation grid than standard methods. Furthermore, CARP enables improved visualization of clustering solutions: the fine solution grid returned by CARP can be used to construct a convex clustering-based dendrogram, as well as forming the basis of a dynamic path-wise visualization based on modern web technologies. Our methods are implemented in the open-source R package clustRviz, available at https://github.com/DataSlingers/clustRviz.

Entities:  

Keywords:  Algorithmic Regularization; Clustering; Convex Clustering; Dendrograms; Optimization; Visualization

Year:  2019        PMID: 32982130      PMCID: PMC7518335          DOI: 10.1080/10618600.2019.1629943

Source DB:  PubMed          Journal:  J Comput Graph Stat        ISSN: 1061-8600            Impact factor:   2.302


  7 in total

1.  Convex biclustering.

Authors:  Eric C Chi; Genevera I Allen; Richard G Baraniuk
Journal:  Biometrics       Date:  2016-05-10       Impact factor: 2.571

2.  Splitting Methods for Convex Clustering.

Authors:  Eric C Chi; Kenneth Lange
Journal:  J Comput Graph Stat       Date:  2015-12-10       Impact factor: 2.302

3.  An ordinary differential equation based solution path algorithm.

Authors:  Yichao Wu
Journal:  J Nonparametr Stat       Date:  2011       Impact factor: 1.231

4.  Statistical properties of convex clustering.

Authors:  Kean Ming Tan; Daniela Witten
Journal:  Electron J Stat       Date:  2015-10-14       Impact factor: 1.125

5.  A Generic Path Algorithm for Regularized Statistical Estimation.

Authors:  Hua Zhou; Yichao Wu
Journal:  J Am Stat Assoc       Date:  2014       Impact factor: 5.033

6.  ConvexLAR: An Extension of Least Angle Regression.

Authors:  Wei Xiao; Yichao Wu; Hua Zhou
Journal:  J Comput Graph Stat       Date:  2015-09-16       Impact factor: 2.302

7.  Comprehensive molecular portraits of human breast tumours.

Authors: 
Journal:  Nature       Date:  2012-09-23       Impact factor: 49.962

  7 in total
  3 in total

1.  Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data.

Authors:  Minjie Wang; Genevera I Allen
Journal:  J Mach Learn Res       Date:  2021-01       Impact factor: 5.177

2.  COBRAC: a fast implementation of convex biclustering with compression.

Authors:  Haidong Yi; Le Huang; Gal Mishne; Eric C Chi
Journal:  Bioinformatics       Date:  2021-04-27       Impact factor: 6.937

3.  Efficient Proximal Gradient Algorithms for Joint Graphical Lasso.

Authors:  Jie Chen; Ryosuke Shimmura; Joe Suzuki
Journal:  Entropy (Basel)       Date:  2021-12-02       Impact factor: 2.524

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.