Literature DB >> 18670042

Coclustering of human cancer microarrays using Minimum Sum-Squared Residue coclustering.

Hyuk Cho1, Inderjit S Dhillon.   

Abstract

It is a consensus in microarray analysis that identifying potential local patterns, characterized by coherent groups of genes and conditions, may shed light on the discovery of previously undetectable biological cellular processes of genes as well as macroscopic phenotypes of related samples. In order to simultaneously cluster genes and conditions, we have previously developed a fast co-clustering algorithm, Minimum Sum-Squared Residue Co-clustering (MSSRCC), which employs an alternating minimization scheme and generates what we call co-clusters in a checkerboard structure. In this paper, we propose specific strategies that enable MSSRCC to escape poor local minima and resolve the degeneracy problem in partitional clustering algorithms. The strategies include binormalization, deterministic spectral initialization, and incremental local search. We assess the effects of various strategies on both synthetic gene expression datasets and real human cancer microarrays and provide empirical evidence that MSSRCC with the proposed strategies performs better than existing co-clustering and clustering algorithms. In particular, the combination of all the three strategies leads to the best performance. Furthermore, we illustrate coherence of the resulting co-clusters in a checkerboard structure, where genes in a co-cluster manifest the phenotype structure of corresponding specific samples, and evaluate the enrichment of functional annotations in Gene Ontology (GO).

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18670042     DOI: 10.1109/TCBB.2007.70268

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  6 in total

1.  Sparse Biclustering of Transposable Data.

Authors:  Kean Ming Tan; Daniela M Witten
Journal:  J Comput Graph Stat       Date:  2014       Impact factor: 2.302

2.  Efficient Mining of Discriminative Co-clusters from Gene Expression Data.

Authors:  Omar Odibat; Chandan K Reddy
Journal:  Knowl Inf Syst       Date:  2014-12       Impact factor: 2.822

3.  Evolutionary constraints and expression analysis of gene duplications in Rhodobacter sphaeroides 2.4.1.

Authors:  Anne E Peters; Anish Bavishi; Hyuk Cho; Madhusudan Choudhary
Journal:  BMC Res Notes       Date:  2012-04-25

4.  A systematic comparative evaluation of biclustering techniques.

Authors:  Victor A Padilha; Ricardo J G B Campello
Journal:  BMC Bioinformatics       Date:  2017-01-23       Impact factor: 3.169

5.  An Improved Kernel Credal Classification Algorithm Based on Regularized Mahalanobis Distance: Application to Microarray Data Analysis.

Authors:  Khawla El Bendadi; Yissam Lakhdar; El Hassan Sbai
Journal:  Comput Intell Neurosci       Date:  2018-06-27

6.  Network-aided Bi-Clustering for discovering cancer subtypes.

Authors:  Guoxian Yu; Xianxue Yu; Jun Wang
Journal:  Sci Rep       Date:  2017-04-21       Impact factor: 4.379

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.