Literature DB >> 19527960

Estimating the number of clusters via system evolution for cluster analysis of gene expression data.

Kaijun Wang1, Jie Zheng, Junying Zhang, Jiyang Dong.   

Abstract

The estimation of the number of clusters (NC) is one of crucial problems in the cluster analysis of gene expression data. Most approaches available give their answers without the intuitive information about separable degrees between clusters. However, this information is useful for understanding cluster structures. To provide this information, we propose system evolution (SE) method to estimate NC based on partitioning around medoids (PAM) clustering algorithm. SE analyzes cluster structures of a dataset from the viewpoint of a pseudothermodynamics system. The system will go to its stable equilibrium state, at which the optimal NC is found, via its partitioning process and merging process. The experimental results on simulated and real gene expression data demonstrate that the SE works well on the data with well-separated clusters and the one with slightly overlapping clusters.

Mesh:

Year:  2009        PMID: 19527960     DOI: 10.1109/TITB.2009.2025119

Source DB:  PubMed          Journal:  IEEE Trans Inf Technol Biomed        ISSN: 1089-7771


  2 in total

Review 1.  A global approach to analysis and interpretation of metabolic data for plant natural product discovery.

Authors:  Manhoi Hur; Alexis Ann Campbell; Marcia Almeida-de-Macedo; Ling Li; Nick Ransom; Adarsh Jose; Matt Crispin; Basil J Nikolau; Eve Syrkin Wurtele
Journal:  Nat Prod Rep       Date:  2013-04       Impact factor: 13.423

2.  Clustering of High Throughput Gene Expression Data.

Authors:  Harun Pirim; Burak Ekşioğlu; Andy Perkins; Cetin Yüceer
Journal:  Comput Oper Res       Date:  2012-12       Impact factor: 4.008

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.