Literature DB >> 15055797

Cluster analysis of gene expression data based on self-splitting and merging competitive learning.

Shuanhu Wu1, Alan Wee-Chung Liew, Hong Yan, Mengsu Yang.   

Abstract

Cluster analysis of gene expression data from a cDNA microarray is useful for identifying biologically relevant groups of genes. However, finding the natural clusters in the data and estimating the correct number of clusters are still two largely unsolved problems. In this paper, we propose a new clustering framework that is able to address both these problems. By using the one-prototype-take-one-cluster (OPTOC) competitive learning paradigm, the proposed algorithm can find natural clusters in the input data, and the clustering solution is not sensitive to initialization. In order to estimate the number of distinct clusters in the data, we propose a cluster splitting and merging strategy. We have applied the new algorithm to simulated gene expression data for which the correct distribution of genes over clusters is known a priori. The results show that the proposed algorithm can find natural clusters and give the correct number of clusters. The algorithm has also been tested on real gene expression changes during yeast cell cycle, for which the fundamental patterns of gene expression and assignment of genes to clusters are well understood from numerous previous studies. Comparative studies with several clustering algorithms illustrate the effectiveness of our method.

Entities:  

Mesh:

Year:  2004        PMID: 15055797     DOI: 10.1109/titb.2004.824724

Source DB:  PubMed          Journal:  IEEE Trans Inf Technol Biomed        ISSN: 1089-7771


  5 in total

1.  SMART: unique splitting-while-merging framework for gene clustering.

Authors:  Rui Fa; David J Roberts; Asoke K Nandi
Journal:  PLoS One       Date:  2014-04-08       Impact factor: 3.240

2.  An Empirical Analysis of Rough Set Categorical Clustering Techniques.

Authors:  Jamal Uddin; Rozaida Ghazali; Mustafa Mat Deris
Journal:  PLoS One       Date:  2017-01-09       Impact factor: 3.240

3.  Rough set based information theoretic approach for clustering uncertain categorical data.

Authors:  Jamal Uddin; Rozaida Ghazali; Jemal H Abawajy; Habib Shah; Noor Aida Husaini; Asim Zeb
Journal:  PLoS One       Date:  2022-05-13       Impact factor: 3.752

4.  Microarray missing data imputation based on a set theoretic framework and biological knowledge.

Authors:  Xiangchao Gan; Alan Wee-Chung Liew; Hong Yan
Journal:  Nucleic Acids Res       Date:  2006-03-20       Impact factor: 16.971

5.  Identification of coherent patterns in gene expression data using an efficient biclustering algorithm and parallel coordinate visualization.

Authors:  Kin-On Cheng; Ngai-Fong Law; Wan-Chi Siu; Alan Wee-Chung Liew
Journal:  BMC Bioinformatics       Date:  2008-04-23       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.