Literature DB >> 27163413

Convex biclustering.

Eric C Chi1, Genevera I Allen2,3, Richard G Baraniuk3.   

Abstract

In the biclustering problem, we seek to simultaneously group observations and features. While biclustering has applications in a wide array of domains, ranging from text mining to collaborative filtering, the problem of identifying structure in high-dimensional genomic data motivates this work. In this context, biclustering enables us to identify subsets of genes that are co-expressed only within a subset of experimental conditions. We present a convex formulation of the biclustering problem that possesses a unique global minimizer and an iterative algorithm, COBRA, that is guaranteed to identify it. Our approach generates an entire solution path of possible biclusters as a single tuning parameter is varied. We also show how to reduce the problem of selecting this tuning parameter to solving a trivial modification of the convex biclustering problem. The key contributions of our work are its simplicity, interpretability, and algorithmic guarantees-features that arguably are lacking in the current alternative algorithms. We demonstrate the advantages of our approach, which includes stably and reproducibly identifying biclusterings, on simulated and real microarray data.
© 2016, The International Biometric Society.

Keywords:  Clustering; Convex optimization; Fused lasso; Gene expression; Reproducible research; Structured sparsity

Mesh:

Year:  2016        PMID: 27163413     DOI: 10.1111/biom.12540

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  14 in total

1.  Biclustering via sparse clustering.

Authors:  Erika S Helgeson; Qian Liu; Guanhua Chen; Michael R Kosorok; Eric Bair
Journal:  Biometrics       Date:  2019-10-14       Impact factor: 2.571

2.  Multiway Graph Signal Processing on Tensors: Integrative analysis of irregular geometries.

Authors:  Jay S Stanley; Eric C Chi; Gal Mishne
Journal:  IEEE Signal Process Mag       Date:  2020-10-29       Impact factor: 12.551

3.  Data-Driven Tree Transforms and Metrics.

Authors:  Gal Mishne; Ronen Talmon; Israel Cohen; Ronald R Coifman; Yuval Kluger
Journal:  IEEE Trans Signal Inf Process Netw       Date:  2017-08-23

4.  Reconstruction of normal forms by learning informed observation geometries from data.

Authors:  Or Yair; Ronen Talmon; Ronald R Coifman; Ioannis G Kevrekidis
Journal:  Proc Natl Acad Sci U S A       Date:  2017-08-22       Impact factor: 11.205

5.  Multi-scale affinities with missing data: Estimation and applications.

Authors:  Min Zhang; Gal Mishne; Eric C Chi
Journal:  Stat Anal Data Min       Date:  2021-11-05       Impact factor: 1.247

6.  Discovering Geometry in Data Arrays.

Authors:  Eric C Chi
Journal:  Comput Sci Eng       Date:  2021-10-14       Impact factor: 2.152

7.  Dynamic Visualization and Fast Computation for Convex Clustering via Algorithmic Regularization.

Authors:  Michael Weylandt; John Nagorski; Genevera I Allen
Journal:  J Comput Graph Stat       Date:  2019-07-19       Impact factor: 2.302

8.  Generalized Co-Clustering Analysis via Regularized Alternating Least Squares.

Authors:  Gen Li
Journal:  Comput Stat Data Anal       Date:  2020-05-04       Impact factor: 1.681

9.  COBRAC: a fast implementation of convex biclustering with compression.

Authors:  Haidong Yi; Le Huang; Gal Mishne; Eric C Chi
Journal:  Bioinformatics       Date:  2021-04-27       Impact factor: 6.937

10.  Regularized matrix data clustering and its application to image analysis.

Authors:  Xu Gao; Weining Shen; Liwen Zhang; Jianhua Hu; Norbert J Fortin; Ron D Frostig; Hernando Ombao
Journal:  Biometrics       Date:  2020-08-24       Impact factor: 1.701

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.