Literature DB >> 32546879

Generalized Co-Clustering Analysis via Regularized Alternating Least Squares.

Gen Li1.   

Abstract

Biclustering is an important exploratory analysis tool that simultaneously clusters rows (e.g., samples) and columns (e.g., variables) of a data matrix. Checkerboard-like biclusters reveal intrinsic associations between rows and columns. However, most existing methods rely on Gaussian assumptions and only apply to matrix data. In practice, non-Gaussian and/or multi-way tensor data are frequently encountered. A new CO-clustering method via Regularized Alternating Least Squares (CORALS) is proposed, which generalizes biclustering to non-Gaussian data and multi-way tensor arrays. Non-Gaussian data are modeled with single-parameter exponential family distributions and co-clusters are identified in the natural parameter space via sparse CANDECOMP/PARAFAC tensor decomposition. A regularized alternating (iteratively reweighted) least squares algorithm is devised for model fitting and a deflation procedure is exploited to automatically determine the number of co-clusters. Comprehensive simulation studies and three real data examples demonstrate the efficacy of the proposed method. The data and code are publicly available.

Entities:  

Keywords:  Biclustering; Exponential Family; Generalized Linear Model; Parafac/Candecomp; Tensor

Year:  2020        PMID: 32546879      PMCID: PMC7297185          DOI: 10.1016/j.csda.2020.106989

Source DB:  PubMed          Journal:  Comput Stat Data Anal        ISSN: 0167-9473            Impact factor:   1.681


  13 in total

1.  Decomposing gene expression into cellular processes.

Authors:  E Segal; A Battle; D Koller
Journal:  Pac Symp Biocomput       Date:  2003

2.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

3.  Biclustering via sparse singular value decomposition.

Authors:  Mihee Lee; Haipeng Shen; Jianhua Z Huang; J S Marron
Journal:  Biometrics       Date:  2010-12       Impact factor: 2.571

4.  Systematic quantification of developmental phenotypes at single-cell resolution during embryogenesis.

Authors:  Julia L Moore; Zhuo Du; Zhirong Bao
Journal:  Development       Date:  2013-08       Impact factor: 6.868

5.  Robust biclustering by sparse singular value decomposition incorporating stability selection.

Authors:  Martin Sill; Sebastian Kaiser; Axel Benner; Annette Kopp-Schneider
Journal:  Bioinformatics       Date:  2011-06-02       Impact factor: 6.937

Review 6.  Biclustering on expression data: A review.

Authors:  Beatriz Pontes; Raúl Giráldez; Jesús S Aguilar-Ruiz
Journal:  J Biomed Inform       Date:  2015-07-06       Impact factor: 6.317

7.  Generalized integrative principal component analysis for multi-type data with block-wise missing structure.

Authors:  Huichen Zhu; Gen Li; Eric F Lock
Journal:  Biostatistics       Date:  2020-04-01       Impact factor: 5.899

8.  Spectral biclustering of microarray data: coclustering genes and conditions.

Authors:  Yuval Kluger; Ronen Basri; Joseph T Chang; Mark Gerstein
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

9.  Interferon-beta therapy in multiple sclerosis: the short-term and long-term effects on the patients' individual gene expression in peripheral blood.

Authors:  Michael Hecker; Christiane Hartmann; Ole Kandulski; Brigitte Katrin Paap; Dirk Koczan; Hans-Juergen Thiesen; Uwe Klaus Zettl
Journal:  Mol Neurobiol       Date:  2013-05-01       Impact factor: 5.590

10.  Identifying Multi-Dimensional Co-Clusters in Tensors Based on Hyperplane Detection in Singular Vector Spaces.

Authors:  Hongya Zhao; Debby D Wang; Long Chen; Xinyu Liu; Hong Yan
Journal:  PLoS One       Date:  2016-09-06       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.